Statistics Module
Statistics Module
1
Module Overview
This module introduces students to the fundamental concepts of data presentation and analysis in
management decision making. The approach will emphasize a practical management problem
solving approach using quantitative reasoning skills rather than the theoretical development of
statistics. Students should develop quantitative research techniques for research writing.
2
Distinguish the different Types of data
Tabulation of data
present data on Frequency distribution, Pictograms ,Pie charts ,Bar charts
Observe the general rule of graphical presentations
CHAPTER 6.PROBABILITY
Estimate probability
Explain the types of events
Describe Conditional probability
Use tree diagrams to calculate probability
3
CHAPTER 1: BASIC NUMERICAL CONCEPTS
1.0 Introduction
The skill of being able to handle numbers well is very important in many jobs. There may often
be need to handle numbers and express them clearly in numerical form – and this is part of
effective communication, as well as being essential to understanding and solving many business
problems.
Firstly, though, there is need to examine the basic processes of manipulating numbers
themselves. In this chapter, a review of a number of basic numerical concepts and operations
shall be done. It is essential to ensure that the basics are fully understood before moving on to
some of the more advanced applications in later Chapters.
1.1 Objectives
calculate ratios and percentages, carry out calculations using percentages and divide
a given quantity according to a ratio;
4
explain the terms index, power, root, reciprocal and factorial.
The number system we use everyday consists of ten figures (i.e. symbols 0 to 9) are used, we call
this number system the “base ten” system or “denary” system. This is illustrated in the
following table.
The number of columns indicates the actual amounts involved. Consider the number
289.Thismeans that, in this number, there are:
two hundreds (289)
eight tens( 289)
nine units (289)
Before we look at other number systems, we can note that it is possible to have negative numbers
which are less than zero. Numbers which are more than zero are positive. We indicate a negative
number by a minus sign (−). A common example of the use of negative numbers is in the
measurement of temperature – for example, −20°C (i.e. 20°C below zero).
We could indicate a positive number by a plus sign (+), but in practice this is not necessary and
we adopt the convention that, say, 73 means +73.
5
1.1.2 Roman number system
We have seen that position is very important in the number system that we normally use. This is
equally true of the Roman numerical system, but this system uses letters instead of figures.
Although we invariably use the normal denary system for calculations, Roman numerals are still
in occasional use in, for example, the dates on film productions, tabulation, house numbers, etc.
Here are some Roman numbers and their denary equivalent:
XIV 14 XXXV1 36
1X 9 LV111 58
Denary Binary
0 0
1 1
2 10
3 11
4 100
5 101
6 110
7 111
8 1000
9 1001
This system is necessary because computers can only easily recognise two figures – zero or one,
corresponding to components of the computer being either switched off or on. When a number is
transferred to the computer, it is translated into the binary system.
6
1.2 Numbers – Approximation and Integers
Every figure used in a particular number has a meaning. However, on some occasions, precision
is not needed and may even hinder communication. For example, if you are calculating the cost
of the journey, you need only express the distance to the nearest mile.
Suppose you are given the figure 27 836 and asked to quote it to the nearest thousand. Clearly,
the figure lies between 27 000 and 28 000. The halfway point between the two is 27 500. 27 836
is more than this and so, to the nearest thousand, the answer would be 28 000.
There are strict mathematical rules for deciding whether to round up or down to the nearest
figure:
Look at the digit after the one in the last required column. Then:
Thus, when rounding off 236 to the nearest hundred, the digit after the one in the last required
column is 3 (the last required column being the hundreds column) and the answer would be to
round down to 200.
When rounding off 3 455 to the nearest ten, the last required column is the tens column and the
digit after that is 5, so we would round up to 3 460.
By significant figures we mean all figures other than zeros at the beginning or end of a number.
Thus:
632 000 has three significant figures as the zeros are not significant;
630002, however, has six significant figures – the zeros here are significant as they occur in the
middle of the number.
7
We can round off a number to a specified number of significant figures. This is likely to be done
to aid the process of communication – i.e. to make the number language easier to understand and
is particularly important when we consider decimals later in the unit.
Consider the figure 2 017. This has four significant figures. To write it correct to three significant
figures, we need to drop the fourth figure i.e. the 7. We do this by using the rounding off rule –
round down from 4 or round up from 5 – so 2 017 correct to three significant figures becomes
2 020.
To write 2,017 correct to one significant figure, we consider the second significant figure, the
zero, and apply the rule for rounding off, giving 2,000. Do not forget to include the insignificant
zeros, so that the digits keep their correct place values.
1.2.3 Integers
These are simply whole numbers. Thus, 0, 1, 2, 3, 4, etc, are integers, as are all the negative
whole numbers.
By adjusting any number to the nearest whole number, we obtain an integer. Thus, a number
which is not an integer can be approximated to become one by rounding off.
Practice Questions
i. ten
ii. hundred
iii. one thousand
(a) 3 946
(b) 8 451
(c) 6 501
8
2. Round off the following figures to the nearest:
i. ten thousand
ii. hundred thousand
(a) 2 836
(b) 3 894
(c) 7 886
(d) 9 726
(a) 1 736
(b) 22 300
(c) 1 901
(d) 26 000
We can use a number scale, like the markings on ruler, as a simple picture of addition and
subtraction. Thus, you can visualise the addition of, say, three and four by finding the position of
three on the scale and counting off a further four divisions to the right, as follows:
9
0 1 2 3 4 5 6 7 8
This brings us to the expected answer of seven.
Similarly we can view subtraction as moving to the left on the scale. Thus, 6 minus 4 is
represented as follows:
0 1 2 3 4 5 6 7 8
This illustrates the sum 6 − 4 = 2.
Negative numbers can also be represented on the number scale. All we do is extend the scale to
the left of zero:
6 5 4 3 2 1 0 1 2 3 4 5 6
We can now visualise the addition and subtraction of negative numbers in the same way as
above.
Consider the sum (−3) + 4. Start at −3 on the scale and count four divisions to the right:
4 3 2 1 0 1 2 3 4
10
Therefore, (−3) + 4 = 1.
(Note that, here, we have shown the figure of “minus 3” in brackets to avoid confusion with an
addition and subtraction signs.)
The order of the figures of these calculations does not matter, so we can also write:
(−3) + 4 = 4 + (−3) = 1
3 2 1 0 1 2 3 4 5
This is the same as 4 − 3. In effect, therefore, a plus and a minus together make a minus.
With subtraction, the process is slightly different. To subtract a minus number, we change the
subtraction sign to a plus and treat the number as positive. For example:
4 − (−3) = 4 + 3 = 7
Consider the follow examples. (Draw your own number scales to help visualise the sums if you
find it helpful.)
7 − (−3) = 7 + 3 = 10
2 + (−6) = 2 − 6 = −4
11
Solve the following sums, using a number scale if necessary:
1. 7 + (−4) 2. (−4) + 7
3. 8 − (−6) 4. 6 + (−8)
1.4 DECIMALS
Decimals are an alternative way of expressing a particular part of a whole. The term “decimal”
means “in relation to ten”, and decimals are effectively fractions expressed in tenths, hundredths,
thousandths, etc.
Unlike fractions, though, decimals are written completely differently and this makes carrying out
arithmetic operations on them much easier.
Decimals do not have a numerator or a denominator. Rather, the part of the whole is shown by a
number following a decimal point:
the first number following the decimal point represents the number of tenths, so 0.1 is
one tenth part of a whole and 0.3 is three tenths of a whole;
12
the next number to the right, if there is one, represents the number of hundredths, so 0.02
is two hundredths of a whole and 0.67 is six tenths and seven hundredths (i.e. 67
hundredths) of a whole;
the next number to the right, again if there is one, represents the number of thousandths,
so 0,005 is five thousandths of a whole and 0.134 is 134 thousandths of a whole; and so
on up to as many figures as are necessary.
1 3 67 1
0 , 1= 0 , 3= 0 , 67= 0,001=
10 10 100 1000
45 6 382 81
0,045= 0,6 = 0,382= 3,81 = 3
1000 10 1000 100
We can see that any fraction with a denominator of ten, one hundred, one thousand, ten
thousand, etc. can be easily converted into a decimal. However, fractions with a different
denominator are also relatively easy to convert by simple division.
Performing addition, subtraction, multiplication and division with decimals is exactly the same
as carrying out those same operations on whole numbers. The one key point to bear in mind is to
get the position of the decimal point in the right place. Layout is all important in achieving this.
To add or subtract decimals, arrange the numbers in columns with the decimal points all being in
the same vertical line. Then you can carry out the arithmetic in the same way as for whole
numbers.
3,6
7,84
+ 9,172
13
Total 20,612
Similarly, subtraction is easy provided that you follow the same rule of ensuring that the decimal
points are aligned vertically. For example, subtract 18.857 from 63.7:
63,7
- 18,857
Total 44,843
However, it is important to get the decimal point in the right place. There is a simple rule for
this; the answer should have the same number of decimal places as there were in the question.
Consider the sum 40 × 0.5. The simplest way of carrying this out is to ignore the decimal points
in the first instance, multiply the sum out as if the figures were whole numbers and then put back
the decimal point in the answer. The trick is to put the decimal point back in the right place.
So, ignoring the decimal points, we can multiply the numbers as follows:
40 × 5 = 200
Now we need to put the decimal point into the answer. To get it in the right place, we count the
number of figures after a decimal point in the original sum; there is just one, in the multiplier
(0,5) and place the decimal point that number of places from the right in the answer:
20,0 or just 20
If the sum had been 0.4 × 0.5, what would the answer be? The multiplication would be the same,
but this time there are 2 figures after decimal points in the original sum. So, we insert the
decimal point two places to the right in the answer: 0.2
14
One final point to note about multiplying decimals is that, to multiply a decimal by ten, you
simply move the decimal point one place to the right. For example:
3,62 × 10 = 36,2
Likewise, to multiply by 100, you move the decimal point two places to the right:
This can be a very useful technique, enabling quick manual calculations to be done with ease.
However, when dividing by ten or by one hundred (or even by a thousand or a million), we can
use the technique noted above in respect of multiplication to find the answer quickly without
using a calculator:
to divide by ten, simply move the decimal point one place to the left – for example,
68,32 ÷ 10 = 6,832
to divide by one hundred, simply move the decimal point two places to the left – for example,
Fractions other than tenths, hundredths, thousandths, etc. can also be converted into decimals.
It is simply a process of dividing the numerator by the denominator and expressing the answer in
decimal form. You can do this manually, but in most instances it is easier to use a calculator.
3
into decimal form, you divide 3 by 5 (i.e. 3 ÷ 5).
5
15
Using a calculator the answer is easily found: 0.6
However, not all fractions will convert so easily into decimals in this way.
1
Consider the fraction
3
Dividing this out (or using a calculator) gives the answer 0.3333333, or however many 3s for
which there is room on the paper or the calculator screen. This type of decimal, where one or
more numbers repeat infinitely, is known as a recurring decimal.
In most business applications, we do not need such a degree of precision, so we abbreviate the
decimal to a certain size – as we shall discuss in the next section. To change a decimal into a
fraction, we simply place the decimal over ten or one hundred, etc, depending upon the size of
the decimal. Thus, the decimal becomes the numerator and the denominator is 10 or 100, etc. For
example:
36 9
0.36 = =
100 25
We noted above that certain fractions do not convert into exact decimals. The most common
examples are one third and two thirds, which convert into 0,3333333, etc. and 0.6666666, etc,
respectively. (When referring to these types of recurring decimals, we would normally just quote
the decimal as, say, “0,3 recurring”.) However, there are also plenty of other fractions which do
convert into exact decimals, but only at the level of thousandths or greater. For example:
2
= 0,2857142, or
7
11
= 0,6470568
17
16
This gives us a problem: how much detail do we want to know? It is very rare that, in business
applications, we would want to know an answer precise to the millionth part of a whole, although
in engineering, that level of precision may be critical.
Such large decimals also present problems in carrying out arithmetic procedures – for example, it
is not easy to add the above two decimals together and multiplying them does not bear thinking
about.
In most business circumstances, then, we limit the number of decimal places to a manageable
number. This is usually set at two or three decimal places, so we are only interested in the level
of detail down to hundredths or thousandths. We then speak of a number or an answer as being
“correct to two (or three) decimal places”.
For decimals with a larger number of decimal places than the required number, we use the
principles of rounding discussed earlier in the unit to reduce them to the correct size. Thus, we
look at the digit in the decimal after the one in the last required place and then:
for figures of five or more, round up the figure in the last required place;
for figures of four or less, round down by leaving the figure in the last required place
unchanged.
So, if we want to express the decimal form of two sevenths (0.2857142) correct to three decimal
places, we look at the digit in the fourth decimal place: 7. Following the rule for rounding, we
would then round up the number in the third decimal place to give us 0.286.
Now consider the decimal forms of one third and two thirds. If we were to express these correct
to two decimal places, these would be 0.33 and 0.67 respectively.
Working to only two or three decimal places greatly simplifies decimals, but you should note
that the level of accuracy suffers. For example, multiplying 0,33 by two gives us 0.66, but we
have shown two thirds as being 0,67. These types of small error can increase when adding,
subtracting, multiplying or dividing several decimals which have been rounded – something
which we shall see later in the course in respect of a set of figures which should add up to 100,
but do not. In such cases, we need to draw attention to the problem by stating that “errors are due
to rounding”.
17
Practice Questions
2. Change the following fractions into decimal form, correct to the number of decimal places
specified:
2 3
(a) to 2 d.p (d) to 3 d.p
5 8
3 7
(b) to 2 d.p (e) to 3 d.p
4 8
5 5
(c) to 3 dp (f) to 3 dp
8 6
3. Change the following decimals into fractions and reduce them to their lowest possible terms
18
1.5 Percentages
A percentage is a means of expressing fractions in parts of a hundred – the words “per cent”
simply mean “per hundred”, so when we say “percentage”, we mean “out of a hundred”. The
expression “20 per cent” means 20 out of 100. The per cent sign is %, so 20 per cent is written as
20%.
4
4% means four hundredths of a whole, or four parts of a hundred =
100
28% means twenty-eight hundredths of a whole, or twenty-eight parts of a hundred =
28
100
100
100% means one hundred hundredths of a whole, or a hundred parts of a hundred =
100
Percentages are used extensively in many aspects of business since they are generally more
convenient and straightforward to use than fractions and decimals. It is important, therefore, that
you are fully conversant with working with them.
Multiplication and division have no meaning when applied to percentages, so we need only be
concerned with their addition and subtraction. These are quite straightforward, but you need to
remember that we can only add or subtract percentages if they are parts of the same whole.
Take an example: If Peter spends 40% of his income on rent and 25% on household expenses,
what percentage of his income remains? Here both percentages are parts of the same whole –
Peter’s income – so we can calculate the percentage remaining as follows:
However, consider this example. If Alan eats 20% of his cake and 10% of his pie, what
percentage remains? Here the two percentages are not parts of the same whole – one refers to
cake and the other to pie. Therefore, it is not possible to add the two together.
19
1.5.2 Percentages and Fractions
Since a percentage is a part of one hundred, to change a percentage to a fraction you simply
divide it by 100. In effect all this means is that you place the percentage over 100 and then
reduce it to its lowest possible terms.
For example:
60 3
60% = =
100 5
24 6
24% = =
100 25
Changing a fraction into a percentage is the reverse of this process – simply multiply by 100. For
example:
3
× 100 =37½% or 37,5%
8
2
× 100 = 66,67%
3
To change a percentage to a decimal, again you divide it by 100. As we saw in the last section,
this is easy – you simply move the decimal point two places to the left. Thus:
11,5% = 0,115
Working the conversion the other way is equally easy. To change a decimal to a percentage,
move the decimal point two places to the right:
0,36% = 36%
20
To find a particular percentage of a given number, change the percentage rate into a fraction or a
decimal and then multiply the given number by that fraction/decimal.
15
£260 × = £39 or £260 × 0,15 = £39
100
To express one number as a percentage of another, we convert the numbers into a fraction with
the first number as the numerator and the second the denominator, and then change the fraction
into a percentage by multiplying by 100.
For example, what percentage of cars has been sold if 27 have gone from a total stock of 300?
27
× 100 = 9%
300
Do not forget that percentages refer to parts of the same whole and, therefore, can only be only
be calculated where the units of measurement of the quantities concerned are the same.
For example, to show 23 pence as a percentage of £2, we need to express both quantities in the
same units before working out the percentage. This could be both as pence or both as pounds:
23 0 ,23
× 100 = 11.5% or × 100 = 11.5%
200 2
There are occasions when we deal with percentages greater than 100%. For example, if a man
sells a car for £1,200 and says that he made 20% profit, how much did he buy it for?
120
The cost, therefore, is 100% To work out the cost, we need to multiply the selling price by :
100
120
Cost = 1,200 × = £1,000
100
To prove that this is correct, we work the calculation through the other way. The cost was £1,000
and he sold it at 20% profit, so the selling price is:
21
£1,000 + 20% of £1,000 = £(1,000 + 0.2 × 1,000) = £1,200, as given above.
Practice Questions
3
(a)
100
3
(b)
8
2. Change the following percentages into fractions and reduce them to their lowest possible
terms
(a) 2% (b) 8%
1.6 Ratios
A ratio is a way of expressing the relationship between two quantities. It is essential that the two
quantities are of expressed in the same units of measurement – for example, pence, number of
people, etc. – or the comparison will not be valid.
22
For example, if we wanted to give the ratio of the width of a table to its depth where the width is
2 m and the depth is 87 cm, we would have to convert the two quantities to the same units before
giving the ratio. Expressing both in centimetres would give the ratio of 200 to 87 (200:87).
Note that the ratio itself is not in any particular unit – it just shows the relationship between
quantities of the same unit.
We use a special symbol (the colon symbol : ) in expressing ratios, so the correct form of
showing the above ratio would be:
They should always be reduced to their lowest possible terms (as with vulgar fractions).
To do this, express the ratio as a fraction by putting first figure over the second and
cancel the resulting fraction. Then re-express the ratio in the form of numerator :
denominator. For example, consider the ratio of girls to boys in a group of 85 girls and 17
boys. The ratio would be 85:17. This can be reduced as follows:
Ratios should always be expressed in whole numbers. There should not be any fractions
or decimals in them. Consider the ratio of average miles per gallon for cars with petrol
engines to that for cars with diesel engines where the respective figures are 33½ mpg and
47 mpg. We would not express this as 33½ : 47, but convert the figures to whole numbers
by, here, multiplying by 2 to give: 67 : 94
Ratios are used extensively in business to provide information about the way in which one
element relates to another. For example, a common way of analysing business performance is to
compare profits with sales. If we look at the ratio of profits to sales across two years, we may be
able to see if this aspect of business performance is improving , staying the same or deteriorating.
(This will be examined elsewhere in you studies.)
23
Another application is to work out quantities according to a particular ratio. For example, ratios
are often used to express the relationship between partners in a partnership – sharing profits on a
basis of, say, 50 : 50 or 80 : 20.
Consider the case of three partners – Ansell, Boddington and Devenish – who share the profits of
their partnership in the ratio of 3 : 1 : 5. If the profit for a year is £18,000, how much does each
partner receive?
first add the terms of the ratio to find the total number of parts;
then find what fraction each term of the ratio is to the whole; and
finally divide the total quantity into parts according to the fractions.
For the Ansell, Boddington and Devenish partnership, this would be calculated as follows:
3
Therefore: Ansell receives of the profits;
9
1
Boddington receives of the profits; and
9
5
Devenish receives of the profits.
9
Therefore:
Ansell gets
3 1
× £18,000 = £6,000 Boddington gets × £18,000 = £2,000
9 9
5
Devenish gets × £18,000 = £10,000
9
Practice Questions
1. Express the following sales and profits figures as profit to sales ratios for each firm:
24
Firm Sales Profits
£ £
B 63 000 7 000
C 30 000 10 000
D 75 000 15 000
(a) X, Y and Z share profits in the ratio of 3 : 4: 5. Total profit is £60 000
2.0 Introduction
In this chapter the main focus will be a review of basic concepts by looking at algebra. Algebra
essentially involves the substitution of letters for numbers in calculations, so that rules and
procedures for carrying out mathematical operations which can be applied whatever the actual
numbers involved can be established. In this sense, then, algebra enables general expressions to
be developed and general results to be obtained.
Equations are also going to be introduced. An equation is simply a statement of equality between
two mathematical expressions, and is generally used to enable one or more unknown quantities
to be worked out. There are various rules and procedures which can be applied to equations to
help this process and these will be examined in full.
Finally, a brief look at formulae will be done. A formula is one particular form of generalised
mathematical statement, usually expressed in the form of an equation, which sets out a rule
which may be applied in particular situations. There are formulae which describe a great many
physical situations or mathematical problems.
2.1Objectives
When students have completed this chapter they will be able to:
25
outline the basic principles of algebra;
apply the basic arithmetic operations of addition, subtraction, multiplication and division
to algebraic notation, and simplify algebraic expressions by the process of collecting like
terms;
define equations and formulae and outline their uses;
find unknown quantities and values from simple equations by using transposition;
outline the principles of formulae and how they are constructed;
rearrange the terms in a formula to isolate different unknowns.
2.2 Definitions
2.2.1 Algebra
Suppose, though, that we substitute letters for the first two numbers, so that:
2=a
3=b
a+b=5
All that has happened is that we have replaced the numbers with letters. However, a number is a
specific quantity – for example, 5 is more than 4, but less than 6 – whereas a letter can be used to
represent any number. Thus, in the above expression, “a” could be 4 and “b” could be 1. We only
know that they are 2 and 3 respectively because we defined them as such before. The main
consequence of this is that algebra uses general expressions and gives general results, whereas
26
arithmetic (using numbers) uses definite numbers and gives definite results. Arithmetic is
specific, whereas algebra is general.
Suppose you have a piece of wood which is 7 metres long and from it you wish to cut a
piece 4 metres long. The length of the remaining piece is 3 m, calculated as follows:
7−4=3
This is a specific arithmetic statement relating to cutting a specific amount from this particular
piece of wood.
We could translate this into an algebraic expression by substituting letters for the specific
lengths:
x−y=3
This is now a general statement for cutting one length of wood from another to leave a piece 3
metres in length.
To find the area of a floor measuring 10 m long and 9 m wide, we multiply one
dimension by the other:
Substituting the letters “l” and “w” to represent the actual length and width, we can reformulate
the expression as:
Area = l × w
27
Again, this is a general expression which can be made specific by putting in particular values for
“l” and “w”.
The distance travelled by a train in 3 hours at a speed of 60 miles per hour is easily calculated as:
3 × 30 = 180 miles
s×t=d
Thus, using algebra – working with letters instead of numbers – allows us to construct general
mathematical expressions. This is not particularly helpful in itself, but is very important when we
come to consider equations.
Note that an equation has two sides. Here, they are “2 + 2” and “4”. The two sides must always
be in equality for the statement to be an equation.
If we now introduce the concept of algebra into this, we have the makings of an extremely useful
mathematical tool.
For example, given that a certain number multiplied by 3 is 6, we can write this as:
To save writing “the required number” (or the “ unknown” number), it is more convenient to call
it
3×x=6
Here are a few simple statements which you can easily write as equations for practice:
28
(a) a certain number is added to 4 and the result is 20.
(a) x + 4 = 20 (b) x × 4 = 20
(c) x − 4 = 5 (d) x ÷ 3 = 1
The point of constructing equations in this way is that provide a means by which you can work
out the unknown value (here, “x”). These examples are very simple and you can probably solve
them (i.e. work out the unknown) very easily. However, equations can get very complicated.
Don’t worry, though – there are a number of simple rules which can be used to help solve them.
There are a number of rules which can be used to help solve equations and we shall consider
these below in relation to equations which have only one unknown. In a later unit we shall
examine more complex equations – those with two or more unknowns – and consider additional
rules. However, these are based on the simple rules which we set out here.
29
This is the golden rule that you must always follow when handling equations and there are no
exceptions.
What you do to one side of the equation, you must also do to the other.
Remember that an equation has two sides and those two sides must always be in equality for the
whole statement to be an equation. So, anything we do to the equation must maintain this
equality. Another way of thinking about this is that an equation is like a balance. If the weights
of a balance are equal on both sides it is “in balance”. You can add an equal weight to each side,
or take an equal weight from each side, and it will remain “in balance”. Looking at the rule in
more detail, we can see a number of possible operations which may be carried out without
changing the equality of the equation.
(a) The same number may be added to both sides of the equation
For example:
x = 10
(b) The same number may be subtracted from both sides of the equation
For example:
x + 6 − 6 = 18 − 6
x = 12
(c) Both sides of the equation may be multiplied by the same number.
For example:
1 1 x
of a number = 10, or x = 10, or = 10
3 3 3
30
x
× 3 = 10 × 3 3x = 30 x = 30
3
(d) Both sides of the equation may be divided by the same number.
For example:
2 x 30
If we divide both sides by 2, we get: =
2 2
x = 15
You should have noticed that the object of such operations is to isolate a single unknown, x, on
one side of the equals sign.
Practice Questions
Simplify the following expressions, writing “no simpler form” if you think any cannot be further
simplified:
1. x − 7 = 21 2. x − 3 = 26
x
3. x + 9 = 21 4. = 11
5
x
5. =7 6. 3x = 30
8
2.4.2 Transposition
When you understand the rules in the previous subsection you may shorten your working by
using transposition instead. Transposition is a process of transferring a quantity from one side of
an equation to another by changing its sign of operation.
An added or subtracted term may be transposed from one side of an equation to the other
31
if its sign is changed from + to −, or from − to +
(a) x − 4 = 6
Transfer the 4 to the right-hand side of the equation and reverse its sign:
x=6+4
x = 10
(b) x + 6 = 18
x = 18 – 6 x = 12
Note that we can keep transposing terms from one side to the other until we have isolated the
unknown quantity.
x
(a) = 12
4
To get x by itself, move 4 across the = sign and change it from divisor to multiplier, i.e.:
x = 12 × 4 x = 48
(b) 2x = 20
To get x by itself, transpose the 2 and change it from multiplier to divisor, i.e.:
32
20
x= x = 10
2
24
(c) =6
x
In this case, we need to transpose x and change it from divisor to multiplier, i.e.:
24 = 6x
Then, to get x by itself, we transpose the multiplier (6) and change it to a divisor:
24
=x
6
x=4
Practice Questions
x 72
1. = 36 2. = 24
6 x
3. 3x + 3 = 21 4. 5x = 25
5. 2x − 3 = 15 6. 21 − x = 15
These equations are treated in the same way as the other equations we have met so far. We
simply keep transposing terms as necessary until we have collected all the unknown terms on
one side of the equation.
33
3x = x + 8 + 6
3x − x = 8 + 6 2x = 14
Finally, move the 2 to the right-hand side changing it from multiplier to divisor, and thus
2 x 14
isolating x on the left: =
2 2
x=7
Practice Questions
1
1. x + 3 = 8 − 2x 2. 2x = 24 – x 3. = 15 − x
2x
2
4. 5x − 3 = 27 + 2x 5. 18 + x = 5x – 2 6. = 24 + 6x
3x
2.5 Formulae
A formula is a mathematical model of a real situation.
When person borrows money, he/she has to pay interest on the loan (called the “principal”). The
amount of interest payable is determined by the annual rate of interest charged on the money
borrowed, taking into account the time over which the loan is repaid. If we want to work out the
amount of interest repayable on a particular loan, the calculation is as follows:
Interest = the principal multiplied by the percentage rate of interest, multiplied by the number of
time periods the interest is to apply. We could model this situation by using algebraic notation
and then show the whole calculation as an equation:
34
let: I = simple interest
P = principal
n = number of periods
r
then: I = P × ×n
100
This equation is the formula for calculating simple interest. It can be used for any simple interest
calculation and enables us to find the value of any one of the elements, provided that we know
all the others.
Thus, what we have done is express the real situation as a mathematical model using an algebraic
equation.
This particular formula is very important in financial studies and can be applied in many
situations.
A man runs at a certain number of km per hour for a certain number of hours, and then cycles at
a certain number of km per hour for a certain number of hours. What is his average speed?
35
Therefore, we can say he covers (xy + pq) kilometres in (y + q) hours.
xy+ pq
km per hour.
x+ y
Note that when we use a letter such as x to represent an unknown quantity, it only represents that
quantity and the units must be exactly specified. For example, we said “let x = the speed at which
he cycles (in kph)” – to have said “let x = the speed at which he cycles” would have been
inaccurate.
The last step is to test the formula by going back to the problem and check that it works by using
real number values in the formula through the process of substitution. We shall look at
substitution below.
As practice, see if you can work the following problem out by developing a formula. Note that
the unknown quantities have already been given letters.
A grocer mixes m kg of tea at x pence per kg with n kg of tea at y pence per kg. If he sells the
mixture at v pence per kg, what profit, in pence, does he make? Bring your answer to £s.
v(m + n) pence
36
Cost price = (mx + ny) pence
v ( m+ n )−(mx+ ny)
Profit = £
100
As we have seen above, formulae consist of letters (algebraic notation) and, sometimes,
numbers. These are combined together in particular ways to express the mathematical
relationships between the terms such that a particular result is obtained.
The numbers in a formula always stay the same and are known as constants. so, for example, in
the formula for simple interest, the constant will always be 100, no matter what the values given
to the P, r and n in a particular situation.
The letters are, on the other hand variables. The value that they have in working out the formula
in any particular situation may vary – it depends on the particular circumstances of the problem.
2.5.3 Substitution
We defined a formula as a mathematical model of a real situation. In order to use that model in a
real situation, we have to give the variables the values which apply in that particular situation.
Doing this is the process of substitution.
Turning back to the formula for simple interest, consider the following example: What is the
simple interest on £1,000 invested at 10% per annum (i.e. per year) for two years? First of all we
should restate the formula:
r
I=P× ×n
100
Now we substitute the values from the problem into the formula:
P = £1 000 r = 10 n=2
37
10
So: I = 1 000 × ×2
100
£1,000 × 10 × 20
£200
Finally, we should note that a formula can be rearranged to find a different unknown quantity.
So, for example, when considering simple interest, we may wish to know how long it would take
to earn a specified amount of interest from a loan made to someone else, at a particular rate of
interest.
The way in which the formula is expressed at present does not allow us to do that, but by using
the rules we discussed above in relation to simple equations, we can rearrange the formula so
that it does.
Prn
I=
100
To find n, we need to rearrange the formula so that n is isolated on one side of the equation.
We can do this by multiplying both sides by 100 and then dividing both sides by Pr
Prn
100 × I = ×100 100I = Pr n
100
Practice Questions
38
Rearrange the simple interest formula to allow the following calculations to be made:
1. To find P – the principal which would provide a certain amount of interest at a particular
rate of interest over a specified period.
2. To find r – the rate of interest which would have to be charged to earn a particular
amount of interest on a specified principal over a certain period of time.
By contrast, a quadratic equation is one that contains the square of the unknown number, but no
higher power. For example, the following are quadratic expressions in x:
4x2 + 7x
2x2 − x + 1
1 2
x − 2.
2
We can also have quadratic expressions in x 2. For example, 3x4 + 2x2 + 5 is a quadratic in x 2,
since it may be also be written as:
3(x2)2 + 2(x2) + 5, or
Note that these are examples of quadratic expressions. A quadratic equation is obtained by
making such an expression equal to a similar expression or to a number.
One of the features of quadratic equations is that, usually, they cannot be solved definitively.
There are generally two possible values for the unknown.
39
A second feature, reflecting this, is that when we draw a graph of the outcome of a quadratic
equation for all the possible values of the unknown, the result is a curve.
If an equation contains two unknown numbers, we cannot determine their values unless we are
told something more. For example, if x and y are two quantities and all we are told about them is
that their product is 12, we cannot tell either of their values x may be 3, in which case y must be
4; or x may be −½, in which case y must be −24, etc. This is true whatever the value of the
product, with the one exception that if xy = 0, then either x = 0 or y = 0.
If x = 0, y may have any value, and if y = 0, x may have any value. They need not both be 0, but
one of them must be, since the product of two numbers is zero only when one of them is itself
zero.
This important fact provides the easiest method of solving a quadratic equation, as is shown in
the following examples.
Example 1
x(x − 3) = 0
Here, the product of the two factors x and (x − 3) is zero. Therefore, one of the factors must be
zero:
either: x = 0 or: (x − 3) = 0
Further, if (x − 3) = 0, then x = 3
40
Thus, the equation is satisfied if x = 0, or if x = 3.
Having understood the principle of this procedure, we do not need to give it in full every time.
The following example simplifies the steps.
Example 2
The important thing to remember is that this applies only to a product the value of which is zero.
Therefore, the above method of solving an equation applies only when the RHS (right-hand
side) of the equation is zero, while the LHS can be expressed in factors.
Example 3
(x − 5)(x − 2) = 4
We must rearrange this so that the RHS is zero. First of all, we have to expand the product of the
factors:
x2 − 7x + 10 = 4
x2 − 7x + 6 = 0
(x − 6)(x − 1) = 0
x = 6, or x = 1
Check this in the original equation just to justify to ourselves that the method works:
41
(x − 5)(x − 2) = 4
If x = 6:
(6 − 5)(6 − 2) = 1 × 4 = 4
If x = 1:
(1 − 5)(1 − 2) = (−4)(−1) = +4
Practice Questions
1. x(x − 1) = 0 2. (x − 1)(x + 2) = 0
3. 3x(x + 1) = 0 4. 4x2 = 0
One of the reasons that the questions above were so easy is that the LHS of the equations were
already expressed in factor terms. Commonly, though, equations are not so simple – the factors
may not already have been found, or the terms are not already collected on one side of the
operation.
Collect all the terms on the LHS, so that the RHS is zero.
Factorise the LHS expression – the product of the factors is then zero.
Equate each factor in turn to zero, where possible.
Finally, give your answers as alternative values for x.
Work through the following two examples carefully to make sure that you understand the
process thoroughly.
Example 4
42
Consider the following equation:
7x2 + 23x = 60
First, collect all the terms on the LHS, so that the RHS is zero
7x2 + 23x − 60 = 0
(7x − 12)(x + 5) = 0
5
x=1 or −5
7
Practice Questions
1. 7 − x2 = 5 – x 2. x2 + 4x + 4 = 0
Since the method used above for solving a quadratic equation applies to all cases, even where no
real roots can be found, it can be used to establish a formula.
43
Consider the equation ax2 + bx + c = 0, where a, b and c may have any numerical value.
Rearranging the equation we get:
−b ± √ b 2−4 ac
x=
2a
This is an important formula. When using it, always write it down first, then write the values of
a, b, c in the particular case, paying great attention to the signs. It can be used to solve any
quadratic equation, although you should, use the factor method whenever possible.
Example 1
5x2 − 15x + 11 = 0
−b ± √ b 2−4 ac
x=
2a
where: a = 5, b = −15, c = 11
Therefore:
15± 2,236
x= x = 1,72 or 1,28 to two decimal places
10
Example 2
3x2 + 4x − 5 = 0
−b ± √ b 2−4 ac
x=
2a
where: a = 3, b = 4, c = −5
44
Therefore:
−4 ± 8,718
x= = 0,79 or −2,12 (to two decimal places)
6
Practice Questions
1. 2x2 + 13x + 4 = 0
2. 5x2 − 3x − 4 = 0
3. x2 − 5x + 2 = 0
4. 4x2 + 7x + 1 = 0
5. x2 − x − 1 = 0
x+y=7
x = 3 and y = 4
x = 2½ and y = 4½
45
To determine a particular pair of unknown values, then, we need more information. One way of
getting more information is to have another equation – for example:
x + y = 7 and 2x + y = 12
Therefore, to find two unknowns we need two equations, and to find three unknowns we need
three equations, etc. These groups of equations are known as simultaneous equations.
Note, though, that for equations to be simultaneous they must satisfy two conditions – they must
be consistent and independent. For example:
If two equations are true simultaneously, any other equation obtained from them will also be
true. The method of solution by elimination depends on this fact. By adding or subtracting
suitable multiples of the given equations, we can eliminate one of the unknowns and obtain an
equation containing only one unknown. Once we have done that, the resulting equation can be
solved using the methods we already know, and then the other unknown may be discovered by
substitution in the original equation. The following examples explain the method. Study them
carefully.
Example 1
3x + y = 11 (1)
x − y = −3 (2)
46
We can add the two equations together to derive a third equation which will eliminate y:
3x + y = 11
x − y = −3
4x = 8
6 + y = 11 y = 11- 6 y=5
We can check that the values we have determined for x and y are correct by substituting both in
equation (2):
x − y = 2 − 5 = −3
Note that eliminating one of the unknowns (in this case, y) was possible because both the “y”s in
the original two equations had a coefficient of 1 – i.e. the coefficients were equal. (The
coefficient is simply the factor or number applied to an algebraic term, so the coefficient of the
term 3x is 3.) If we want to eliminate one of the unknowns by adding or subtracting the two
equations, the coefficients of the unknowns must be equal.
Example 2
6x + 5y = −6 (1)
18x + 7y = 6 (2)
We want one of the unknowns to have equal coefficients, but unlike in Example 1, this is not the
case here. However, we can see that the coefficient of x in (2) is a multiple of the coefficient of x
in (1). To make the coefficients equal, we derive a changed version of the same equation by
multiplying (1) by 3 and then subtract (2) from this equation:
47
18x + 15y = −18
Note that, providing we apply the same proportionate change to each term in the equation (by
multiplying or dividing each by the same number), the equation has not been changed. It is not a
new, independent equation, but simply a different way of expressing the same equation.
18x − 7y = 6
8y = −24
y = −3
6x + (−15) = −6
By transposition, we get:
6x = −6 + 15 = 9
x = 1.5
Example 3
13x − 7y = 33 (1)
9x + 2y = 16 (2)
This time, there are no equal or multiple coefficients, so there is no obvious unknown term which
can be easily eliminated. We can, then, choose which unknown to eliminate and, to avoid large
numerical terms, we shall decide to eliminate y.
Where there are no equal or multiple coefficients, we need to change the way in which both
equations are expressed so that we can make the coefficients of one of the unknowns equal. To
48
do this here, we multiply (1) by 2 and (2) by 7, so that the coefficient of y in both equations is
14.
26x − 14y = 66
89x = 178
x=2
18 + 2y = 16
By transposition, we get:
2y = 16 − 18 = −2
y = −1
Important note: When working through the processes of transposition and substitution, it is not
necessary to write − (−7) and + (−15), etc., but be sure to pay special attention to all calculations
involving negative numbers.
Practice Questions
1. 3x + 4y = 1
9x + 5y = 10
49
2. 5x − 3y = 14
7x + 5y = 38
3. 3x − y = 8
x + 5y = 4
4. 2x − 11y = 23
7x − 8y = 50
3.0 Introduction
This chapter will detail the rate of exchange between different currencies and convert values in
one currency into another. Secondly discounts and commissions; calculation of them from given
information will follow.
50
Formulae for simple interest and compound interest, and how it is used them to calculate any of
the amount of interest payable (or receivable), the principal, the rate of interest or the time period
covered, given information about the other variables will also be explained.
Lastly an outline of the way in which gross pay is made up and the main deductions made from
gross pay in order to leave net or take-home pay will be done. Furthermore calculation of the
income tax payable on pay, taking into account non-taxable items and personal tax allowances
will also be detailed.
3.1 Objectives
After completing this Chapter students should be able to:
explain the rate of exchange between different currencies and convert values in one
currency into another;
explain discounts and commissions and calculate them from given information;
state the formulae for simple interest and compound interest, and use them to calculate
any of the amount of interest payable (or receivable), the principal, the rate of interest or
the time period covered, given information about the other variables;
outline the way in which gross pay is made up and the main deductions made from gross
pay in order to leave net or take-home pay; calculate the income tax payable on pay,
taking into account non-taxable items and personal tax allowances
You are, no doubt, familiar with various currency systems. They are invariably based on the
decimal number system – for example, America uses the dollar ($) and the cent, which is
equivalent to one hundredth of a dollar, France has the French franc (FF) and the centime, which
51
is one-hundredth of a franc, British currency comprises the pound sterling (£) and pence, which
are one-hundredth of a pound and Botswana uses the pula (P) and the thebe, which are one-
hundredth of a pula.
The values of the different currencies used to depend upon the amount of gold which the country
possessed, but now that we are no longer on the “gold standard”, their values depends on the
political and economic circumstances of the countries concerned. Such value is measured by how
much one unit of currency costs in another currency.
For example, one unit of British currency (£1) costs approximately 1,45 units of US currency
($1,44). This relationship of one currency to another is known as the rate of exchange.
These values vary from day to day and are quoted in most daily papers. The changes in value are
brought about by such things as changes in government, changes in government policies, the
national budget, the relationship between the value of exports and imports, etc. To give you some
idea of the scale of changes over time, before the Second World War the value of the dollar was
about 21p (i.e. $1 = £0,21), whereas now it is about 69p (i.e. $1 = £0,69). However, such large
changes usually only occur over a long period or because of some particular economic problem.
In general, the changes are very small, although they can take place quite rapidly. Even then,
although the changes may be very slight, their effects can be very large when you consider the
amounts of money involved – companies may have many millions of pounds sterling invested in
overseas currencies, and a small change in the value of those currencies can produce substantial
losses or gains, depending on the whether the value of the currency rises or falls against the
pound. The rate of exchange is expressed as a ratio. It shows the value of one unit of currency in
terms of another currency. Thus, one side of the ratio is always “1”.
British £:
£1 = 10,55 FF
52
£1 = $1,44
£1 = P 16
Note that, although it is a ratio, the rate of exchange is usually shown as “x = y”, rather than “x :
y”. It is this ratio which determines how much of one currency can be bought with another
currency, or how much a particular amount of one currency is worth in another.
Before we look at the relevant calculations involved in converting one currency to another, there
are two further points to note about the rate of exchange.
For any relationship between two different currencies, the rate of exchange can be
expressed in two ways – how much one unit of one currency is worth in terms of the
other, and vice versa.
Thus, the rate of exchange between the £ and the DM could be shown as:
£1 = 3,15 DM or
1 DM = £0,32
The rate of exchange is usually quoted in two forms – a selling price and a buying price.
The selling price determines the amount of foreign currency you will be sold for your
pounds, and the buying price determines the amount of pounds you will receive in return
for your foreign currency. (The difference between the two enables the bank or other
agency undertaking the exchange to make a profit.)
As an example of the two different prices, the rates quoted might be:
£1 = 10,6 FF 10,5 FF
£1 = 3,16 DM 3,14 DM
53
£1 = $1,43 $1,45
To find the value of a sum of money in one currency in terms of another currency, we could
develop a simple formula using the rate of exchange, as follows
To convert a certain amount of the first currency (say, £x) into the second currency (say $y), we
multiply the amount of the first currency by the rate of exchange. Thus the formula would be:
y=x×e
So, to convert £100 into $ at the rate of £1 = $1,44, we would substitute the following values into
the formula:
x = 100
e = 1,44
If we wanted to know much a particular sum of the second (say $y) currency would be worth in
the first (say £x) currency, we would have to rearrange the above formula to isolate “x”. To do
this, we simply divide each side of the equation by “e” – the rate of exchange:
y x ×e y
= x=
e e e
54
1. What is the cost in £ of a bill of exchange in Paris for 1 275 francs if the rate of exchange
is £1 = 9,90 FF?
The rate of exchange is expressed as 9.90 units of one currency is equivalent to 1 unit of
the other. So, FF is the second currency and £ the first, and “x” is the amount in £s and
“y” the amount in FF.
Substituting in the second formula (to find x) we get:
1275
a. £x = £ = £128,79
9 , 90
(Note that the actual result of the calculation is £128,78787, but we round up to express it
correct to two decimal places as required for currency.)
2. Find the value of £63,40 in Swiss francs if the rate of exchange is £1 = 3.90 SF
The rate of exchange is expressed as 3.90 units of the second currency (SF) is equivalent
to 1 unit of the first (£). So, again “x” is the amount in £s and “y” the amount in SF.
Substituting in the first formula (to find y) we get:
y SF = 63.40 × 3.90 = 246.26 SF
3. Convert $3,000 to pounds given that £1 = $1,45
The rate of exchange is expressed as 1,45 units of the second currency ($) is equivalent to
1 unit of the first (£). So, again “x” is the amount in £s and “y” the amount in $.
Substituting in the second formula (to find x) we get:
£x = £ = £2 068,97
Practice Questions
1. John and Paul ate a meal in a restaurant whilst on holiday in Corfu. The meal for two cost
4 256 drachma. Calculate the cost of the meal in pounds. The exchange rate was 266 Dr
to the pound.
55
2. The Smiths spent a holiday touring in France. While travelling they used 200 litres of
petrol which cost 4,7 francs per litre. The exchange rate was 9,8 francs to the pound.
(a) how much did the petrol cost them in pounds?
(b) what was the price per litre of the petrol in pence?
3. Before going on holiday to Germany and Austria, the Jones family changed £600 into
German marks. While in Germany they spent 824 DM and then changed the remaining
marks into Austrian schillings as they crossed the border. The exchange rate was 2,798
DM to the pound and 7,047 Sch to 1 DM. Calculate:
(a) the number of marks they received;
(b) the number of schillings they bought.
3.3.1 Discounts
A discount is a reduction in the selling price of an item, usually offered as an inducement for
quantity buying or for prompt payment. The discount is expressed as a percentage of the list
price (the list rice is the original price) and what the buyer actually pays is the net price – the list
price minus the discount.
Multiply the list price (or base, B) by the rate of the discount (R) and divide by 100. Thus:
R
D=B×
100
(a) A three-piece suite was listed as £720, but was sold at a discount of 15%. What is
the net price?
56
15
Discount = £720 × = £108
100
(b) A car was listed at £3,600 but was sold for cash less a discount of 20%. What was
the final selling price?
20
Discount = £3 600 × = £720
100
3.3.2 Commission
A commission is the amount of money paid to an agent or broker or salesman for buying or
selling goods or services. This is expressed as a percentage of the value of the goods/services
traded and is called the rate of commission.
To find the commission, we simply multiply the value of the goods/services traded (known as the
principal) by the rate of commission. Consider the following example.
Pauline obtained £6 400 of new business for the PR agency for which she works. She is paid
commission of 12½%. Calculate the amount due to her.
12, 5
= £6,400 × = £800
100
Commissions are also usually charged by banks, travel agents and bureau de changes to cover
their costs when carrying out a currency exchange. The commission on changing cash is
commonly £3 or 2% whichever is larger.
57
When going on holiday, most people take a limited amount of cash in the currency of each
country to be visited and take the remainder that they will need in the more secure form of
travellers cheques.
We shall consider fixed and variable elements in more detail later, but consider the following
example in relation to currency exchanges.
Using the rates of commission quoted above, and given that £1 = $1.47, how many dollars would
you get when converting:
In each case, we would need to work out the commission at both the fixed and the variable rate,
and then apply whichever is the larger.
(a) The commission payable on converting £100 into $ cash is either £3 or £2 (2% of £100).
£100 − £3 = £97
(b) The commission payable on converting £200 into $ cash is either £3 or £4 (2% of £200).
£200 − £4 = £196
58
£196 × 1.47 = $288.12
However, the commission payable on converting £200 into $ travellers cheques is either £3 or £2
(1% of £200). The fixed rate is the larger so that would be applied:
£200 − £3 = £197
Practice Questions
1. What is the price paid for a £310 washing machine with a discount of 10%?
2. What is the price paid for a £7 000 car with a discount of 15%?
3. An estate agent charges a commission of 1,5% of the value of each house he sells. How much
commission is earned by selling a house for £104 000?
4. Calculate the commission earned by a shop assistant who sold goods to the value of £824 if
the rate of commission is 3%?
5. A car salesman is paid 2,5% commission on his weekly sales over £6 000. In one particular
week, he sold two cars for £7 520 and £10 640. What was his commission for that week?
6. On a holiday touring Germany, Switzerland and Austria, a family of four decided to take the
equivalent of £900 abroad. The travel agent charges commission on each currency exchange at
the rate of £3 or 2% (whichever is the larger) for cash and £3 or 1% (whichever is the larger) for
travellers cheques.
(a) They changed £110 into Deutschmarks, £100 into Swiss francs and £120 into Austrian
schillings. How much of each currency did they receive after commission was deducted.
(b) After buying the foreign currencies, they changed as much as possible of the remaining
money into travellers cheques. The smallest value of travellers cheque which can be
bought is £10.
(ii) How much commission did they pay for the cheques?
59
(c) What was the total cost per person of the foreign currencies and travellers cheques?
Simple interest (SI) is calculated on the basis of having a principal amount, say P, in the bank for
a number of years, T, with a rate of interest, R. The simple formula to work out the amount of
interest your money will earn:
PRT
SI =
100
This means that the interest is found by multiplying the principal by the rate, then by the time,
and then dividing by 100.
As we have seen from previous chapters, we can also rearrange this formula to allow us to work
out any of the variables.
Thus:
60
(a) John had £16.40 in an account that paid simple interest at a rate of 9%. Calculate how much
interest would be paid to John if he kept the money in the account for 5 years.
The principal (P) is £16,40 the rate (R) is 9% and the time (T) is 5 years. So, substituting inthe
formula, we get:
16 , 40 ×9 ×5
SI =
100
(b) Calculate the length of time taken for £2 000 to earn £270 if invested at 9%.
Here need to work out T where P = £2 000, SI = £270 and R = 9. Substituting in the formula,
we get:
100× 270
= 1.5 years
2000× 9
Practice Questions
1. What is the simple interest on £250 over 3 years at 8,75% interest rate?
2. If you invest £6 750 at 8,5% per annum, how much interest will you earn and how much will
you have in your account after 4 years?
3. If you invest £5 000 at 9 25% per annum for 6 months, how much interest will you receive?
4. £10 800 was invested and at the end of each year the interest was withdrawn. After 4 years, a
total of £3 240 in interest had been withdrawn. At what annual rate of interest was the money
invested?
If an amount of money is invested and the interest is added to the investment, then the principal
increases and, hence, the interest earned gets bigger each year. Suppose you begin with £1 000
and the interest is 8% per year. The interest in the first year is 8% of £1 000 which is £80.
Adding this to the original principal, the amount invested at the end of the first year is £1 080.
61
Now, in the second year, the interest is 8% of £1 080 which is £86,40. The amount accrued at the
end of the second year is £1 166,50.
You can continue working out the amounts in this way. However, to find the amount at the end
of any year, you are finding 100% + 8% = 108% of the amount at the beginning of the year. So
you can multiply by 1,08 each time.
There is a standard formula which can be used to calculate compound interest, as follows:
( )
n
r
A=P 1+
100
(a) £3 000 is invested in an account paying 12% compound interest per year. Find the value of
the investment after 3 years.
62
( )
n
r
Alternatively, we could use the formula: A=P 1+
100
( )
n
r
Substituting these values in the formula gives: A=P 1+
100
( ) = £3 000 × 1,12
3
12
A=3000 1+ 3
100
A= £3 000 × 1,404928
£2000 is invested in a high interest account for one and a half years. The interest rate is 10.5%
per annum, and it is paid into the account every six months. Calculate the value of the investment
after this time and the amount of interest earned.
( )
n
r
Therefore, the rate of interest for 6 months = 5.25% A=P 1+
100
( )
3
5 ,25
Substituting these values in the formula gives: A=2000 1+
100
= £2,000 × 1.05253
= £2,000 × 1.1659134
Value of the investment after 1,5 years= £2,331.83 (to the nearest p)
Practice Questions
63
1. £6 000 is invested at 10% per annum compound interest which is paid
annually. How much is in the account after 3 years?
2. £2,000 is invested at 10% per annum payable every 6 months. How much is in
the account at the end of 1½ years?
3. One building society offers an interest rate of 8.5% per annum payable half-
yearly. A second offers 8.75% payable annually. If you had £2,000 to invest,
which savings account should you choose?
3.6 Annuity
An annuity is a contractual financial product sold by financial institutions that is designed to
accept and grow funds from an individual and then, upon annuitisation pay out a stream of
payments to the individual at a later point in time. The period of time when an annuity is being
funded and before payouts begin is referred to as the accumulation phase Once payments
commence, the contract is in the annuitisation phase
Annuities can be structured generally as either fixed or variable. Fixed annuities provide regular
periodic payments to the annuitant.
Variable annuities allow the owner to receive greater future cash flows if investments of the
annuity fund do well and smaller payments if its investments do poorly. This provides for a less
stable cash flow than a fixed annuity, but allows the annuitant to reap the benefits of strong
returns from their fund’s investments.
64
The present value of annuity formula determines the value of a series of future periodic
payments at a given time. The present value of annuity formula relies on the concept of time
value of money, in that one dollar present day is worth more than that same dollar at a future
date.
All employees receive a financial reward as payment for their labour. At its most basic, this is
paid in the form of either a salary or wages. Salaries are generally paid monthly and are based on
a fixed annual amount. Thus:
Annual Salary
Monthly pay =
12
For example, an employee with an annual salary of £18 000 would receive £1 500 every month
(£18 000 ÷ £1 500).
Wages are generally paid weekly and are usually based on a fixed rate for working a certain
number of hours. For example, many wage earners are required to work a fixed number of hours
in a week, and they are paid for these hours at a basic hourly rate.
Thus, an office clerk is paid £4 per hour for a 40 hour week. What is his weekly wage?
= £4 × 40
= £160
65
An employee can often increase a basic wage by working longer than the basic week – i.e. by
doing overtime. A higher hourly rate is usually paid for these additional hours, the most common
rates being time and a half (i.e. 1½ times the basic hourly rate) and double time (twice the basic
hourly rate).
Consider, for example, a factory worker who works a basic 36 hour week for which she is paid a
basic rate of £5,84 per hour. In addition, she works 5 hours overtime at time and a half and 3
hours overtime at double time. Calculate her weekly wage.
People who are employed as salespersons or representatives, and some shop assistants, are often
paid a basic salary or wage plus a percentage of the value of the goods they have sold. In some
cases, their basic salary or wage can be quite small, or even non-existent, and the commission on
their sales forms the largest part of their pay.
For example, a salesman earns a basic salary of £8,280 per year plus a monthly commission of
5% on all sales over £5,000. What would be his income for a month in which he sold goods to
the value of £9 400?
= £690 + £220
= £910
66
Practice Questions
1. Calculate the weekly wage of a factory worker if she works 42 hours per week at a rate of
pay of £5,25 per hour
2. If an employee’s annual salary is £18 750 how much is she paid per month?
3. A basic working week is 36 hours and the weekly wage is £306. What is the basic hourly
rate?
4. A trainee mechanic works a basic 37,5 hour week at an hourly rate of £4,90. He is paid
overtime at time and half. How much does he earn in a week in which he does 9 hours
overtime?
5. An employee’s basic wage is £6,20 per hour, and she works a basic 5 day, 40 hour week.
If she works overtime during the week, she is paid at time and a half. Overtime worked at
the weekend is paid at double time. Calculate her wage for a week when she worked five
hours overtime during the week and four hours overtime on Saturday.
6. An insurance representative is paid purely on a commission basis. Commission is paid at
8% on all insurance sold up to a value of £4 000 per week. If the value of insurance sold
exceeds £4 000, he is paid a commission of 18% on the excess. Calculate his total pay for
the 4 weeks in which his sales were £3 900, £4 500, £5 100 and £2 700.
3.7.2 Taxation
The total earnings made up in the above way is known as gross pay. Unfortunately, very few
employees actually receive all the money they have earned. Certain amounts of money are
deducted from gross pay and the pay the employee actually ends up with is the net pay or take-
home pay.
There are often other deductions – some being further national or local taxes on pay and others
being deductions made by the employer for work specific reasons (such as repayments of a loan
for a car or for a public transport season ticket).
67
Income tax is a major source of finance for the government and is used to fund all types of public
expenditure. The tax is a percentage of gross pay, based on the amount a person earns in a tax
year that begins on 6th April and ends on 5th April the following year. The % rate at which the
tax is applied varies according to the level of income. The amount of gross income on which tax
is payable (known as taxable pay) may be affected by the provision of certain allowances within
the tax system and also by certain elements of pay or deductions being exempted from tax ( non-
taxable items). The rates of tax applied to particular bands of income, and the levels of
allowances, are decided each year by the government as part of the Budget.
The UK system divides taxable pay into three bands of income. For simplicity, we shall consider
the bands and the rate of tax applied to each to be as follows:
0 – £2 400 10%
This means that, for a person with a total taxable pay of £15 000 in a year, the total tax liability
would be:
40% £28 001 and above £2 334 and above £539 and above
68
There are many possible allowances and non-taxable items which affect the level of taxable pay.
Here we shall consider just two.
The amount of money an individual is allowed to earn each year before he/she starts to pay tax.
This amount (sometimes called free-pay) depends on the individual’s circumstances – for
example, whether they are single or married, number of children, etc. If your personal
allowances are greater than your actual pay, then you would pay no income tax at all. The levels
of personal allowances are generally set as an annual amount, but this can be converted into
monthly or weekly amounts of “free-pay”.
The amounts paid by an individual into a pensions scheme, such as an employer’s occupational
pension scheme, are not taxable. Therefore, pensions contributions, where they are deducted
from pay by the employer, may also be deducted from gross pay before calculating tax. (Note
that this does not necessarily apply to any other types of deductions from pay. Where pensions
contributions are not paid through the employer, an adjustment may be made to a person’s
personal allowances to allow for them.)
Now we can develop a framework for calculating the amount of tax payable annually, using the
tax bands set out above.
Consider the following examples, using the bands and tax rates set out above.
69
(a) How much annual tax will be paid by someone who earns £18 600 per annum and has
personal allowances of £5 800? What is the monthly tax deduction?
= £18 600
= £12 800
= £3 804
(b) How much annual tax will be paid by someone who earns £43 000 per annum, has personal
allowances of £4,400 and pays pension contributions through his employer at a rate of 5% of his
annual salary.
= £9 252
70
Note that taxation questions are often set without reference to income bands with different
taxation rates applying, or without reference to deductions from gross pay or personal
allowances. Always answer the question using the information supplied.
For example:
How much tax will be paid each month by someone who earns £24,000 per annum if the
tax rate is 25%?
£ 24 000
Net taxable pay per month, based on the information supplied = = £2,000
2 000
Practice Questions
1. An employee is paid a salary of £24 492 per annum which he receives in 12 monthly
payments. He has personal allowances amounting to £4 800. If tax is payable at the rate
of 24%, calculate his monthly pay.
2. If, in the next Budget, the Chancellor changes the rate of income tax from 24% to 23%,
but wages remain the same, how much less tax will be paid by the employee in question 1
above.
3. Someone whose personal tax allowance is £4 800 is paid a weekly wage of £80. Would
this person have to pay tax?
4. An employee has a gross income of £23 000 and personal allowances of £5 000. She pays
£3600 tax. What is the rate of income tax?
5. A weekly paid worker receives a basic wage of £5,50 per hour for a 37,5 hour week.
Overtime is paid at a rate of time and a half for the first five hours, and at double time for
any additional hours worked. She also belongs to an occupational pension scheme,
making contributions at a rate of 6% of basic earnings only. As a single mother, she has a
personal taxation allowance of £5 850. Calculate her net pay in a week when she works
48 hours if the tax bands and rates are as set out above
71
CHAPTER 4: STATISTICS AND DATA ANALYSIS
4.0 Introduction
This Chapter is concerned specifically with statistics.
“Statistics” is a word which is used in a variety of ways and with a variety of meanings, but, in
whatever way it is used, it is always concerned with numerical information. There are two
particular meanings of the word namely:
The numerical facts themselves – for example, we talk of the “statistics” of steel
production.
The methods of analysing the facts – in this sense, “statistics” is the title of a subject like
“arithmetic” or “chemistry” or “physics”. Sometimes the subject is called “statistical
method”.
Thus, statistics is concerned with numerical facts about a problem or issue – the facts themselves
and methods of analysing them in order to solve or shed light on the problem or issue. The
numerical facts with which statistics is concerned are commonly called data. So, therefore, we
talk about the collection and analysis of data in respect of a particular problem.
The methods used for analysing data can be divided into two categories – descriptive and
inferential.
Descriptive methods simply manipulate the available data in order to convey information
about that data – often in the form of summaries or graphical displays, and usually for the
purpose of simplifying comprehension or appreciation of the facts. These methods are
concerned only with the available data, even if that data is only a part of a (possibly much
larger) whole.
Inferential methods use the available data in order to make general statements about the
whole from which that data is only a part – i.e. they allow us to infer something about the
whole from a study of just a part of it. Predictions and estimates are typical of the general
conclusions which may be made by using inferential methods.
72
4.1 Objectives
When students have completed this chapter they should be able to:
We can often make good use of the great amount of statistical data published by governments,
the United Nations, nationalised industries, chambers of trade and commerce and so on. When
using this method, it is particularly important to be clear on the definition of terms and units and
on the accuracy of the data. The source must be reliable and the information up-to-date.
This type of data is sometimes referred to as secondary data in that the investigator himself has
not been responsible for collecting it and it thus came to him “second-hand”.
By contrast, data which has been collected by the investigator for the particular survey in hand is
called primary data. The information you require may not be found in one source but parts may
appear in several different sources.
Although the search through these may be time-consuming, it can lead to data beingobtained
relatively cheaply and this is one of the advantages of this type of data collection. Of course, the
73
disadvantage is that you could spend a considerable amount of time looking for information
which may not be available.
Another disadvantage of using data from published sources is that the definitions used for
variables and units may not be the same as those you wish to use. It is sometimes difficult to
establish the definitions from published information, but, before using the data, you must
establish what it represents.
In this method the investigator collects the data him/herself. This method has the advantage that
the data will be collected in a uniform manner and with the subsequent analysis in mind.
However, the field that may be covered by personal investigation may, naturally, be limited.
There is sometimes a danger that the investigator may be tempted to select data that accords with
preconceived notions. The personal investigation method is also useful if a pilot survey is carried
out prior to the main survey, as personal investigation will reveal the problems that are likely to
occur.
When the field to be covered is extensive, the task of collecting information may be too great for
one person. Then a team of selected and trained investigators or interviewers may be used. The
people employed should be properly trained and informed of the purposes of the investigation;
their instructions must be very carefully prepared to ensure that the results are in accordance with
the “requirements” of the study.
If there are many investigators, personal biases may tend to cancel out. Care in allocating the
duties to the investigators can reduce the risks of bias. For example, if you are investigating the
public attitude to a new drug in two towns, do not put investigator A to explore town X and
investigator B to explore town Y, because any difference that is revealed might be due to the
towns being different, or it might be due to different personal biases on the part of the two
investigators. In such a case, you would try to get both people to do part of each town.
4.2.4: Questionnaire
74
In some enquiries the data consists of information which must be supplied by a large number of
people. The most convenient way to collect such data is to issue questionnaire forms to the
people concerned and ask them to fill in the answers to a set of printed questions.
This method is usually cheaper than delegated personal investigation and can cover a wider field.
A carefully thought-out questionnaire is often also used in the previous methods of investigation
in order to reduce the effect of personal bias.
The distribution and collection of questionnaires by post suffers from two main drawbacks:
The forms are completed by people who may be unaware of some of the requirements
and who may place different interpretations on the questions – even the most carefully
worded ones!
There may be a large number of forms not returned, and these may be mainly by people
who are not interested in the subject or who are hostile to the enquiry.
The result is that we end up with completed forms only from a certain kind of person and thus
have a biased sample. For this reason, it is essential to include a reply-paid envelope to
encourage people to respond.
If the forms are distributed and collected by interviewers, a greater response is likely and queries
can be answered. This is the method used, for example, in the Botswana Population Census. Care
must be taken, however, that the interviewers do not lead respondents in any way.
Primary data is data which is both original and has been obtained in order to solve the specific
problem in hand. Primary data is, therefore, raw data and has to be classified and processed using
appropriate statistical methods in order to reach a solution to the problem.
Secondary data is any data other than primary data. Thus, it includes any data which has been
subject to the processes of classification or tabulation or which has resulted from the application
of statistical methods to primary data, and all published statistics.
75
4.3.2: Quantitative and qualitative
Quantitative variables, to which we shall restrict discussion here, are those for which
observations are numerical in nature.
Qualitative variables have non-numeric observations, such as colour of hair, although, of course,
each possible non-numeric value may be associated with a numeric frequency.
A continuous variable may take any value between two stated limits (which may possibly be
minus and plus infinity). Height, for example, is a continuous variable, because a person’s height
may (with appropriately accurate equipment) be measured to any minute fraction of a millimetre.
A discrete variable, however, can take only certain values occurring at intervals between stated
limits. For most (but not all) discrete variables, these interval values are the set of integers
(whole numbers).
For example, if the variable is the number of children per family, then the only possible values
are 0, 1, 2, etc. because it is impossible to have other than a whole number of children. However,
in Britain, shoe sizes are stated in half-units, and so here we have an example of a discrete
variable which can take the values 1, 1½, 2, 2½, etc.
4.4.1: Homogeneity
76
Houses cannot be compared unless they are of a similar size or value. If the data is found not to
be homogeneous, there are two methods of adjustment possible.
(a) Break down the group into smaller component groups which are homogeneous and study
them separately.
(b) Standardise the data. Use units such as “output per man-hour” to compare the output of
two factories of very different size. Alternatively, determine a relationship between the
different units so that all may be expressed in terms of one – in food consumption
surveys, for example, a child may be considered equal to half an adult.
4.4.2: Completeness
Great care must be taken to ensure that no important aspect is omitted from the enquiry.
Each term used in an investigation must be carefully defined; it is so easy to be slack about this
and to run into trouble. For example, the term “accident” may mean quite different things to the
injured party, the police and the insurance company. When using other people’s statistics, for
changes in definition. Acts of Parliament may, for example, alter the definition of an “indictable
offence” or of an “unemployed person”.
4.4.4: Uniformity
The circumstances of the data must remain the same throughout the whole investigation. It is no
use, for example, comparing the average age of workers in an industry at two different times if
the age structure has changed markedly. Likewise, it is not much use comparing a firm’s profits
at two different times if the working capital has changed.
77
4.5 Tabulation of Data
Having completed the survey and collected the data, there is need to organise it so that useful
information can be extracted and then present our results. The data will very often consist of a
mass of figures in no very special order. For example, we may have a card index of the 3,000
workers in a large factory; the cards are probably kept in alphabetical order of names, but they
will contain a large amount of other data such as wage rates, age, sex, type of work, technical
qualifications and so on. If are required to present to the factory management a statement about
the age structure of the labour force (both male and female), then the alphabetical arrangement
does not help, and no one could possibly gain any idea about the topic from merely looking
through the cards as they are.
What is needed is to classify the cards according to the age and sex of the worker and then
present the results of the classification as a tabulation.
This covers only one aspect of the set of figures. The idea is best conveyed by an example.
Consider the card index of 3,000 workers mentioned earlier. Assume that each card carries the
name of the workshop in which the person works. A question as to how the labour force is
distributed can be answered by sorting the cards and preparing a simple table as follows:
78
4.6.2: Complex tabulation
This deals with two or more aspects of a problem at the same time. In the problem just studied, it
is very likely that the two questions would be asked at the same time, and we could present the
answers in a complex table or matrix.
This table is much more informative than are the previous simple table, but it is more
complicated. We could have complicated it further by dividing the groups into, say, male and
female workers, or into age groups.
So far, tables above have merely classified the already available figures – the primary statistics.
However, we can go further than this and do some simple calculations to produce other figures –
secondary statistics. As an example, take the first simple table illustrated from the previous page,
and calculate how many employees there are on average per workshop. This is obtained by
dividing the total (3 000) by the number of shops (5), and the table appears as follows:
79
Table 4.3: Distribution of workers by workshop
For another example, we may take the second simple table given above and calculate the
proportion of workers in each wage group, as follows:
The proportions given here are also secondary statistics. In commercial and business statistics, it
is more usual to use percentages than proportions – so in the above table, these would be 3,5%,
17%; 30,7%; 33,8%; 10% and 5%.
Secondary statistics are not, of course, confined to simple tables; they are used in complex tables
too, as in this example:
80
Table 4.5: Inspection results for a factory product in two successive years
The percentage columns and the average line show secondary statistics. All the other figures are
primary statistics.
Note carefully that percentages cannot be added or averaged to get the percentage of a total or
of an average. Percentages must be worked out on the totals or averages themselves.
Another danger in the use of percentages has to be watched, and that is that you must not forget
the size of the original numbers. Take, for example, the case of two doctors dealing with a
certain disease. One doctor has only one patient and he cures him – 100% success! The other
doctor has 100 patients of whom he cures 80 – only 80% success! You can see how very unfair it
would be on the hard-working second doctor to compare the percentages alone.
Try not to include too many features in any one table (say, not more than four or five) as
otherwise it becomes rather clumsy. It is better to use two or more separate tables.
Each table should have a clear and concise title to indicate its purpose.
It should be very clear what units are being used in the table (tonnes, £, people, £000,
etc.).
81
Blank spaces and long numbers should be avoided, the latter by a sensible degree of
approximation.
Columns should be numbered to facilitate reference.
Try to have some order to the table, using, for example, size, time, geographical location
or alphabetical order.
Figures to be compared or contrasted should be placed as close together as possible.
Percentages should be pleased near to the numbers on which they are based.
Rule the tables neatly – scribbled tables with freehand lines nearly always result in
mistakes and are difficulty to follow. However, it is useful to draw a rough sketch first so
that you can choose the best layout and decide on the widths of the columns.
Insert totals where these are meaningful, but avoid “nonsense totals”. Ask yourself what
the total will tell you before you decide to include it. An example of such a “nonsense
total” is given in the following table:
The totals (470) at the foot of the two columns make sense because they tell us the total
number of seats being contested, but the totals in the final column (550, 390, 940) are
“nonsense totals” for they tell us nothing of value.
If numbers need to be totalled, try to place them in a column rather than along a row for
easier computation.
If there is need to emphasise particular numbers, then underlining, significant spacing or
heavy type can be used. If data is lacking in a particular instance, then insert an asterisk
(*) in the empty space and give the reasons for the lack of data in a footnote.
Footnotes can also be used to indicate, for example, the source of secondary data, a
change in the way the data has been recorded, or any special circumstances which make
82
the data see odd.
If we now assemble and tabulate these figures, we can obtain a frequency distribution as follows:
A useful way of preparing a frequency distribution from raw data is to go through the records as
they stand and mark off the items by the “tally mark” or “five-bar gate” method.
First look at the figures to see the highest and lowest values so as to decide the range to be
covered and then prepare a blank table. Then mark the items on your table by means of a tally
mark.
83
To illustrate the procedure, the following table shows the procedure for the above example after
all 20 items have been entered.
Sometimes the data is so extensive that a simple frequency distribution is too cumbersome and,
perhaps, uninformative. Then we make use of a “grouped frequency distribution”. In this case,
the “length of time” column consists not of separate values, but of groups of values:
Grouped frequency distributions are only needed when there is a large number of values and, in
practice, would not have been required for the small amount of data in our example.
Table 4.8 shows a grouped frequency distribution used in a more realistic situation, when an
ungrouped table would not have been of much use.
84
Table 4.9: Age distribution of workers in an office
The various groups (for example, “26 – 30”) are called “classes” and the range of values covered
by a class (five years in this example) is called the “class interval”. The number of items in each
class (for example, 28 in the 26 – 30 class) is called the “class frequency” and the total number
of items (in this example, 220) is called the “total frequency”.
The term “class boundary” is used to denote the dividing line between adjacent classes, so in the
age group example the class boundaries are 16, 21, 26, .... years. In the length of time example,
as grouped earlier in this section, the class boundaries are 3.35, 3.55, 3.75, 3.95 minutes. This
needs some explanation. As the original readings were given correct to one decimal place, we
assume that is the precision to which they were measured. If we had had a more precise
stopwatch, the times could have been measured more precisely. In the first group of 3.4 to 3.5
are put times which could in fact be anywhere between 3.35 and 3.55 if we had been able to
measure them more precisely. A time such as 3.57 minutes would not have been in this group as
it equals 3.6 minutes when corrected to one decimal place and it goes in the 3.6 to 3.7 group.
Another term, “class limits”, is used to stand for the lowest and highest values that can actually
occur in a class. In the age group example, these would be 16 years and 20 years 364 days for the
first class, 21 years and 25 years 364 days for the second class and so on, assuming that the ages
were measured correct to the nearest day below. In the length of time example, the class limits
are 3.4 and 3.5 minutes for the first class and 3.6 and 3.7 minutes for the second class.
85
4.9 Choice of Class Interval
When compiling a frequency distribution make the length of the class interval equal for all
classes so that fair comparison can be made between one class and another. Sometimes,
however, this rule has to be broken – for example, some grouped frequency distributions bring
together the first few classes and the last few classes because there are few instances in each of
the individual classes.
In such cases, before using the information, it is as well to make the classes comparable by
calculating a column showing “frequency per interval of so much”, as in this example for wage
statistics:
These intervals let you see how the last column was compiled.
A superficial look at the original table (first two columns only) might have suggested that the
most frequent incomes were at the middle of the scale, because of the appearance of the figure
55 000. But this apparent preponderance of the middle class is due solely to the change in the
length of the class interval, and column three shows that, in fact, the most frequent incomes are
at the bottom end of the scale, i.e. the top of the table.
86
Remember that the purpose of compiling a grouped frequency distribution is to make sense of an
otherwise troublesome mass of figures. It follows, therefore, that there should not be too many
groups or we will be little better off; nor do we want too few groups or we will fail to see the
significant features of the distribution.
Very often we are not specifically interested in the separate class frequencies, but in the number
of items above or below a certain value. When this is the case, we form a cumulative frequency
distribution as illustrated in column three of the following table:
The cumulative frequency tells us the number of items equal to or less than the specified value,
and it is formed by the successive addition of the separate frequencies. A cumulative frequency
column may also be formed for a grouped distribution.
The above example gives us the number of items “less than” a certain amount, but we may wish
to know, for example, the number of persons having more than some quantity. This can easily
be done by doing the cumulative additions from the bottom of the table instead of the top, and as
an exercise you should now compile the “more than” cumulative frequency column in the above
example.
87
Relative Frequency Distributions
All the frequency distributions which we have looked at so far in this study unit have had their
class frequencies expressed simply as numbers of items. However, remember that proportions or
percentages are useful secondary statistics. When the frequency in each class of a frequency
distribution is given as a proportion or percentage of the total frequency, the result is known as a
“relative frequency distribution” and the separate proportions or percentages are the “relative
frequencies”.
The total relative frequency is, of course, always 1.0 (or 100%). Cumulative relative frequency
distributions may be compiled in the same way as ordinary cumulative frequency distributions.
As an example, the distribution used in Table 6.14 is now set out as a relative frequency
distribution for you to study.
This example is in the “less than” form, and you should now compile the “more than” form in
the same way as you did for the non-relative distribution.
5.0 Introduction
This chapter covers further methods of summarising and presenting data to provide useful
information. Diagrams are particularly effective as a means of conveying quite complex
88
information. Presenting information visually is easy to understand and enables broad
distributions and trends to be taken in quickly.
However, they are rarely as accurate as the figures themselves and details are likely to be lost. In
this first part of the unit specific graphical forms of presenting frequency distributions of the type
considered in the previous chapter will be examined. A number of common graphical
presentations that are designed more for the lay reader than someone with statistical knowledge
will also be discussed.
5.1 Objectives
After completing this study unit students should be able to:
use frequency frequency bar charts, pictograms, pie charts to represent frequency
distributions;
prepare pictograms, pie charts and bar charts from given data, and discuss the
circumstances in which each form of presentation is most effective;
describe the general rules and principles for using graphical representations of data.
5.2 Pictograms
This is the simplest method of presenting information visually. These diagrams are variously
called “pictograms”, “ideograms”, or “picturegrams” – the words all refer to the same thing.
Their use is confined to the simplified presentation of statistical data for the general public.
Pictograms consist of simple pictures which represent quantities. There are two types and these
are illustrated in the following examples. The data we will use is shown in Table 5.1
Table 5.1: Cruises organised by a shipping line between Year 1 and Year 3
89
passengers carried
1 200 100 000
2 250 140 000
3 300 180 000
Although these diagrams show that the number of cruises has increased each year, they can give
false impressions of the actual increases. The reader can become confused as to whether the
quantity is represented by the length or height of the pictograms, their area on the paper, or the
volume of the object they represent. It is difficult to judge what increase has taken place.
(Sometimes you will even find pictograms in which the sizes shown are actually wrong in
relation to the real increases.)
5.2.2: Accurate Form
If the purpose of the pictogram is to convey anything but the most minimal of information, to
avoid confusion, it is better to try and give it a degree of accuracy in representing the numerical
90
data. This is usually through representing specific quantities of the variable with pictures or
graphics of some sort – usually related to the subject of the variable.
So, the number of passengers could be represented by using pictures of people as in
Figure 5.2
Figure 5.2: Number of passengers carried Years 1-3
The figures are taken from the Annual Abstract of Statistics as shown in Table 5.2
Table 5.2: Gas sales in Great Britain in One Year
Uses Million Therms %
91
Domestic 1 383 51
Industrial 843 31
Commercial 437 16
Public* 60 2
Total 2 723 100
(Source: Annual Abstract of Statistics, Great Britain)
* Central and local government uses, including public lighting
The figures are illustrated as the pie chart or circular diagram in Figure 5.3
2
16
Domestic
Industrial
51 Commercial
Public*
31
92
Label the diagram clearly, using a separate “legend” or “key” if necessary. (A key is
illustrated in Figure 5.3) You can insert the exact figures for the each segment if more
detail is required, since it is not possible to read this exactly from the diagram itself.
If you have the choice, don’t use a diagram of this kind with more than four or five
component parts.
The main use of a pie chart is to show the relationship each component part bears to the
whole. They are sometimes used side by side to provide comparisons, but this is not
really to be recommended, unless the whole diagram in each case represents exactly the
same total amount, as other diagrams (such as bar charts, which we discuss next) are
much clearer.
Figure 5.4: Component bar chart showing gas sales in Great Britain
93
900 843
800
700
600
500 437 Million Therms
400 %
300
200
100 51 60
31 16
0 2
0
Domestic Industrial Commercial Public*
Note that the lengths of the components represent the amounts, and that the components are
drawn in the same order so as to facilitate comparison. These bar charts are preferable to circular
diagrams because:
They are easily read, even when there are many components.
They are more easily drawn.
It is easier to compare several bars side by side than several circles.
Bar charts with vertical bars are sometimes called “column charts” to distinguish them from
those in which the bars are horizontal (see Figure 7.14).
Figure 5.5: Horizontal bar chart of visitors arriving in the UK in one year
European Common Wealth USA Others
51% 23% 24% 4%
This figure is also an example of a percentage component bar chart, i.e. the information is
expressed in percentages rather than in actual numbers of visitors.
When comparing several percentage component bar charts, each bar chart will be the same
length, as they each represent 100%, but they will not necessarily represent the same actual
94
quantities – for example, 50% might have been 1 million, whereas in another year it may have
been nearer to 4 million and in another to 8 million.
5.4.2 Horizontal Bar Chart
A typical case of presentation by a horizontal bar chart is shown in Figure 5.6 Note how a loss is
shown by drawing the bar on the other side of the zero line.
Figure 5.6: Horizontal bar chart for the So and So Company Ltd to show profits made by
branches in Year 1 and Year 2
Pie charts and bar charts are especially useful for “categorical” variables as well as for numerical
variables. The example in Figure 5.6 shows a categorical variable – i.e. the different branches
form the different categories, Figure 5.6 is also an example of a multiple or compound bar chart
as there is more than one bar for each category.
95
The axes of graphs must be clearly labelled, and the scales of values clearly marked.
Diagrams should be accompanied by the original data, or at least by a reference to the
source of the data.
Avoid excessive detail, as this defeats the object of diagrams.
Wherever necessary, guidelines should be inserted to facilitate reading.
Try to include the origins of scales. Obeying this rule sometimes leads to rather a waste
of paper space.
6.0 Introduction
This chapter mainly focuses on one of the objects of statistical analysis which is to condense
unwieldy data so as to make it more readily understood. The drawing of frequency curves
96
enables important general statement concerning statistics to be made. This can be taken further to
calculate some figure which will serve to indicate the general level of the variable under
discussion.
The single value which can be used to describe the general level of the variate is called a
“measure of location” or a “measure of central tendency”. There are three such measures with
which are going to be discussed in detail:
The arithmetic mean
The mode
The median.
6.1 Objectives
By the end of this chapter students should be able to:
97
C 64
D 71
E 71
F 66
G 74
Total 483
The arithmetic mean of these heights is 483 ÷ 7 = 69 inches. Notice that some values occur more
than once, but we still add them all. At this point we must introduce a little algebra. We don’t
always want to specify what particular items we are discussing (heights, egg weights, wages,
etc.) and so, for general discussion, we use, as you will recall from algebra, some general letter,
usually x. Also, we indicate the sum of a number of x’s by ∑ (sigma).
Thus, in our example, we may write:
∑x = 483
We shall be using “sigma notation” extensively in this unit, so make sure you understand the
meaning of the expression. We indicate the arithmetic mean by the symbol x (called “ x bar”) or
mew (called µ) and the number of items by the letter n. The calculation of the arithmetic mean
can be described by formula thus:
∑x 1
µ(mean) = or µ= ∑ x
n n
The latter expression is customary in statistical work. Applying it to the example above, we
have:
1
µ ( mean )= ( 483 )=¿69 inches
7
Arithmetic mean is often simply referred to as “the mean” when there is no chance of confusion
with other means (which we are not concerned with here).
6.1.1The Mean of a Simple Frequency Distribution
When there are many items (i.e. when n is large), the arithmetic can be eased somewhat by
forming a frequency distribution. Consider the following example.
98
Example 1
Table 6.2: Height distribution data
Indicating the frequency of each value by the letter f, you can see that ∑f = n and that, when the
x’s are not all the separate values but only the different ones, the formula becomes:
∑ fx 483
Mean = which is = 69 inches as before.
∑f 7
Of course, with only seven items it would not be necessary, in practice, to use this method, but if
we had a much larger number of items the method would save a lot of additions.
Table 6.3: Data for calculation of arithmetic mean of a grouped frequency distribution
99
∑ fx 1
Mean = = × 1 660= 33,2
∑f 50
Provided that ∑f is not less than about 50 and that the number of groups is not less than about 12,
the arithmetic mean thus calculated is sufficiently accurate for all practical purposes.
There is, though, one pitfall to be avoided when using this method – if all the groups should not
have the same class interval, be sure that you get the correct mid-values.
100
inclusion of his/her income in the arithmetic mean for the village might make the place
seem very much better off than it really was
The arithmetic mean is reasonably easy to calculate and to understand.
In more advanced statistical work it has the advantage of being amenable to algebraic
manipulation.
0 27
1 39
2 30
3 20
4 7
Total 123
In this case the most frequently occurring value is 1 (it occurred 39 times) and so the mode of
this distribution is 1. Note that the mode, like the mean, is a value of the variate, x, not the
frequency of that value. A common error is to say that the mode of the above distribution is 39.
This is wrong. The mode is 1.
For comparison, calculate the arithmetic mean of the distribution – it works out at 1.52.
The mode is used in those cases where it is essential for the measure of location to be an actually
occurring value.
101
An example is the case of a survey carried out by a clothing store to determine what size of
garment to stock in the greatest quantity. The average size of garment in demand might turn out
to be, let us say, 9, 3724, which is not an actually occurring value and doesn’t help us to answer
our problem. However, the mode of the distribution obtained from the survey would be an actual
value (perhaps size 8) and it would provide the answer to the problem.
All we can really say is that “60 – 70” is the modal group (the group with the largest frequency).
Table 6.5: Data for determining the mode of a grouped frequency
distribution
Group Frequency
0 – 10 6
10 – 20 10
20 – 30 16
30 – 40 24
40 – 50 32
50 – 60 38
60 – 70 40
70 – 80 20
80 – 90 10
90 – 100 5
100 – 110 1
110 – 120 6
You may be tempted to say that the mode is 65, but this is not true, nor even a useful
approximation in most cases. The reason is that the modal group depends on the method of
102
grouping, which can be chosen quite arbitrarily to suit our convenience. The distribution could
have been set out with class intervals of five instead of 10, and would then have appeared as
follows (only the middle part is shown, to illustrate the point):
103
6, 7, 7, 8, 8, 9, 10, 10, 10, 12, 13 The total of these eleven numbers is 100 and the arithmetic
100
mean is therefore = 9, 091, while the mode is 10 because that is the number which occurs
11
most often (three times). The median, however, is 9 because there are five values above and five
values below 9.
Our first rule for determining the median is, therefore, as follows:
Arrange all the values in order of magnitude and the median is then the middle value.
Note that all the values are to be used – even though some of them may be repeated, they must
all be put separately into the list. In the example just dealt with, it was easy to pick out the
middle value because there was an odd number of values. But what if there is an even number?
Then, by convention, the median is taken to be the arithmetic mean of the two values in the
middle. For example, take the following set of values:
6, 7, 7, 8, 8, 9, 10, 10, 11, 12
The two values in the middle are 8 and 9, so the median is 8½.
104
Table 6.6: Accident distribution data
It can now be seen from the last column that, if there were to be another list of all the separate
values in order, the first 27 would all be 0s and from then up to the 66th would be 1s. It follows,
therefore, that the 62nd item would be a 1 and that the median of this distribution is 1.
105
Table 6.7: Data for determining the median of a grouped frequency distribution
The total frequency is 206 and therefore the median is the 103rd item which, from the
cumulative frequency column, must lie in the 60 – 70 group. But exactly where in the 60-70
group? Well, there are 92 items before we get to that group and we need the 103rd item, so we
obviously need to move into that group by 11 items. Altogether in our 60 – 70 group there are 38
items, so we need to move 11/38 of the way into that group, that is 11/38 of 10 above 60. Our
median is therefore:
110
60 + = 60 + 2.89 = 62.89
38
106
It can be obtained without actually having all the values. If, for example, we want to
know the median height of a group of 21 men, we do not have to measure the height of
every single one; it is only necessary to stand the men in order of their heights and then
only the middle one (No.11) need be measured, for his height will be the median height.
The median is thus of value when we have open-ended classes at the edges of the
distribution as its calculation does not depend on the precise values of the variate in these
classes, whereas the value of the arithmetic mean does.
The median is not very amenable to further algebraic manipulation.
6.4 Range
This is the simplest measure of dispersion; it is simply the difference between the largest and the
smallest.
Table 6.8: Data for determining the range of a grouped frequency distribution
In the example just given, we can see that the lowest weekly output for Factory A was 90 and the
highest was 107; the range is therefore 17. For Factory B the range is 156 – 36 = 120. The larger
range for Factory B shows that it performs less consistently than Factory A.
The advantage of the range as a measure of the dispersion of a distribution is that it is very easy
to calculate and its meaning is easy to understand. For these reasons it is used a great deal in
industrial quality control work. Its disadvantage is that it is based on only two of the individual
values and takes no account of all those in between.
107
As a result, one or two extreme results can make it quite unrepresentative. Consequently, the
range is not much used except in the case just mentioned.
You will notice that we have added the relative cumulative frequency scale to the right of the
graph. 100% corresponds to 206, i.e. the total frequency. It is then easy to read off the values of
the variate corresponding to 25%, 50% and 75% of the cumulative frequency, giving the lower
quartile (Q1), the median and the upper quartile (Q3) respectively.
Q1 = 46.5 Median = 63 Q3 = 76
108
The difference between the two quartiles is the interquartile range and half of the difference is
the semi-interquartile range or quartile deviation:
Quartile deviation = Q3 – Q1
2
76−46.5
=
2
= 14.75
206
Alternatively, you can work out: 25% of the total frequency, i.e. =¿ 51.5, and
4
75% of the total frequency, i.e. 154.5
and then read from the ogive the values of the variate corresponding to 51.5 and 154.5 on the
cumulative frequency scale (i.e. the left-hand scale). The end result is the same.
109
Table 6.9: Output from factory a over 10 weeks
Variance = (xi – x )2
n
228.50
=
10
= 22.85
110
each result by its frequency.
This is shown in Table 6.10
Table 6.10: Determining the standard deviation of a simple frequency distribution
The standard deviation is then worked out in the same way as previously, as follows:
Standard Deviation(S.D) =
=
√ 228.5
10
= √ 22.85 (the variance)
= 4.78.
6.6.3 Characteristics of the Standard Deviation
In spite of the apparently complicated method of calculation, the standard deviation is the
measure of dispersion used in all but the very simplest of statistical studies. It is based on all of
the individual items, it gives slightly more emphasis to the larger deviations but does not ignore
the smaller ones and, most important, it can be treated mathematically in more advanced
statistics.
111
6.7 The Coefficient of Variation
Suppose that we are comparing the profits earned by two businesses. One of them may be a
fairly large business with average monthly profits of £50,000, while the other may be a small
firm with average monthly profits of only £2,000. Clearly, the general level of profits is very
different in the two cases, but what about the month-by-month variability? We can compare the
two firms as to their variability by calculating the two standard deviations.
Let us suppose that both standard deviations come to £500. Now, £500 is a much more
significant amount in relation to the small firm than it is in relation to the large firm so that,
although they have the same standard deviations, it would be unrealistic to say that the two
businesses are equally consistent in their month-to-month earnings of profits.
To overcome this difficulty, we can express the SD as a percentage of the mean in each case. We
call the result the “coefficient of variation”.
Applying the idea to the figures which we have just quoted, we get coefficients of variation
(usually indicated in formulae by V or CV) as follows:
500
For the large firm: V = ×100 = 1.0%
50000
500
For the small firm: V = ×100 = 25.0%
2000
This shows that, relatively speaking, the small firm is more erratic in its earnings than the large
firm.Note that although a standard deviation has the same units as the variate, the coefficient of
variation is a ratio and thus has no units.
The coefficient of variation is calculated by expressing the standard deviation as a percentage of
the mean:
S. D
Coefficient of variation (CV) = ×100
mean
By using the coefficient of variation you can compare dispersions of various distributions, such
as heights measured in cm with a distribution of weights measured in kg.
Another application of the coefficient of variation comes when we try to compare distributions
the data of which are in different units – for example, when we try to compare a French business
with an American business.
112
To avoid the trouble of converting the dollars to francs (or vice versa) we can calculate the
coefficients of variation in each case and thus obtain comparable measures of dispersion.
CHAPTER 7: PROBABILITY
7.0 Introduction
“Probability” is one of those ideas about which we all have some notion but which are not very
definite. Initially, we will not spend our time trying to get an exact definition, but will confine
ourselves to the task of grasping the idea generally and seeing how it can be used. Other words
which convey much the same idea are “chance” and “likelihood”. Just as there is a scale of, say,
temperature, because some things are hotter than others, so there is a scale of probability because
some things are probable than others. Snow is more likely to fall in winter than in summer; a
healthy person has more chance of surviving an attack of influenza than an unhealthy person.
There is, note, some uncertainty in these matters. Most things in real life are uncertain to some
degree or other, and it is for this reason that the theory of probability is of great practical value.
It is the branch of mathematics which deals specifically with matters of uncertainty. For the
purpose of learning the theory, it is necessary to start with simple things like coin-tossing and
dice-throwing, which may seem a bit remote from business and industrial life, but which will
help you understand the more practical applications.
7.1 Objectives
By the end of this chapter students should be able to:
Understand the basic concepts of probability and probability distributions, the basic
‘rules’ of probability, expected values and the use of probability and decision trees,
Types of events, Conditional probability, Tree diagrams and Estimate Probabilities
113
Probability is measured on a scale between 0 and 1. Any event which is impossible has a
probability of 0, and any event which is certain to occur has a probability of 1.
For example, the probability that the sun will not rise tomorrow is 0; the probability that a light
bulb will fail sooner or later is 1. For uncertain events, the probability of occurrence is
somewhere between 0 and 1. The 50-50 chance mentioned above is equivalent to a probability of
0.5.
Try to estimate probabilities for the following events. Remember that events which are more
likely to occur than not have probabilities which are greater than 0.5, and the more certain they
are the closer the probabilities are to 1.
Similarly, events which are more likely not to occur have probabilities which are less than 0.5.
The probabilities get closer to 0 as the events get more unlikely.
(a) The probability that a coin will fall heads when tossed.
(b) The probability that it will snow next Christmas.
(c) The probability that sales for your company will reach record levels next year.
(d) The probability that your car will not break down on your next journey.
(e) The probability that the throw of a die will show a six.
Sometimes probabilities can be specified by considering the physical aspects of the situation.
For example, consider the tossing of a coin. What is the probability that it will fall heads? There
are two sides to a coin. There is no reason to favour either side as a coin is symmetrical.
114
Therefore, the probability of heads, which we call P(H) is:
P(H) = 0.5
Another example is throwing a die. A die has six sides. Again, assuming it is not weighted in
favour of any of the sides, there is no reason to favour one side rather than another. Therefore,
the probability of a six showing uppermost, P (6), is:
1
P (6) = = 0.167
6
As a third and final example, imagine a box containing 100 beads of which 23 are black and 77
white.
If we pick one bead out of the box at random (blindfold and with the box well shaken up), what
is the probability that we will draw a black bead? We have 23 chances out of 100, so the
probability is:
23
P (B) = = 0.23
100
Probabilities of this kind, where we can assess them from our prior knowledge of the situation,
are also called “a priori” probabilities.
In general terms, we can say that if an event E can happen in h ways out of a total of n possible
equally likely ways, then the probability of that event occurring (called a success) is given by:
For example, what is the probability that an item on a production line will fail a quality control
test?
This question can be answered either by measuring the probability in a test situation (i.e.
empirically) or by relying on previous results. If 100 items are taken from the production line
115
and tested, then:
5
P(F) = = 0.05
100
11
Probability of passing, P (Pass) = = 0.92
12
Mutually exclusive
Non-mutually-exclusive
Independent
Dependent or non-independent
Complementary.
If two events are mutually exclusive then the occurrence of one event precludes the possibility of
the other occurring.
For example, the two sides of a coin are mutually exclusive since, on the throw of the coin,
“heads” automatically rules out the possibility of “tails”. On the throw of a die, a six excludes all
other possibilities. In fact, all the sides of a die are mutually exclusive; the occurrence of any one
of them as the top face automatically excludes any of the others.
116
These are events which can occur together. For example, in a pack of playing cards hearts and
queens are non-mutually-exclusive since there is one card, the queen of hearts, which is both a
heart and a queen and so satisfies both criteria for success.
These are events which are not mutually exclusive and where the occurrence of one event does
not affect the occurrence of the other. For example, the tossing of a coin in no way affects the
result of the next toss of the coin; each toss has an independent outcome.
These are situations where the outcome of one event is dependent on another event. For example,
the probability of a car owner being able to drive to work in his car is dependent on him being
able to start the car. The probability of him being able to drive to work given that the car starts is
a conditional probability. We can express this as:
An event either occurs or it does not occur, i.e. we are certain that one or other of these situations
holds. For example, if we throw a die and denote the event where a six is uppermost by A, and
the event where a one, two, three, four or five is uppermost by A (or not A), then A and A are
complementary, i.e. they are mutually exclusive with a total probability of 1.
This relationship between complementary events is useful as it is often easier to find the
probability of an event not occurring than to find the probability that it does occur. Using the
above formula, we can always find P(A) by subtracting P(A ) from 1.
117
P(A|B)
The events A and B must be causally linked in some way for A to be conditional upon B. For
example, we noted earlier the case where a car owner’s ability to drive to work in his car
(event A) is dependent on him being able to start the car (event B). There is self evidently a
causal link between the two events.
However, if we consider the case him being able to drive to work given that he is married, there
is no necessary link between the two events – and there is no conditional probability.
In a case involving equally likely outcomes for both events (i.e. both events A and B may or may
not occur), the conditional probability of event A occurring, given that B also occurs, is
calculated by the following formula:
n( A∧B)
P(A|B) = where n = the number of possible outcomes.
n(B)
One approach to determining probability in such circumstances is to construct a table of the
possible outcomes – sometimes called a contingency table.
Consider the case of 100 candidates attempting their driving test for the first time. 80 of the
candidates pass and 50 of them have had 10 or more lessons. Of those who have had 10 or more
lessons, 45 of them passed. What is the probability of passing at the first attempt if you have had
only 6 lessons?
We can construct a table which identifies the possible outcomes as follows:
118
possibility.
We have listed the possible outcomes of event B along the side – these are either to have
had 10 or more lessons, or less than 10 lessons. Thus, the rows will show the number of
outcomes for either possibility.
We have entered in the information that we know – that there are a total of 100 possible
outcomes, that the total number of outcomes in which event A was “passed” is 80, that
the total number of outcomes in which event B is “had 10 or more lessons” is 50, and that
the total number of outcomes in which event A was “passed” and event B was “had 10 or
more lessons” is 45.
We can complete the rest of the table by subtraction as follows:
Table 7.2: Contingency Outcomes Table
We can now work out the total outcomes applicable to the question “what is the probability of
passing at the first attempt if you have had only 6 lessons?”.
Remember that event A is “passed” and event B is “having had less than 10 lessons”. Therefore:
The total number of outcomes of B = 50
The total number of outcomes of both A and B = 35
Thus the probability is:
n( A∧B) 35
P(A|B) = = = 0.7
n(B) 50
We could consider the question “what is the probability of failing at the first attempt if you have
had more than 10 lessons?”. Here event A is “failed” and event B is “having had more than 10
lessons”.
The calculation is:
n( A∧B) 5
P(A|B) = = =¿ 0.1
n(B) 50
See if you can work out the answer to the following questions using the above table:
What is the probability of failing at the first attempt if you have only had 8 lessons?
119
Remember to be clear about which is event A and event B, then read off the total number of
outcomes for event B and for event A and B together, and then apply the formula.
Event A is “failed” and event B is “having less than 10 lessons”. The calculation then is:
n( A∧B) 15
P(A|B) = = = 0.3
n(B) 50
In the following examples, we shall consider the implications of conditional probability – the
probability of one event occurring if another event has also occurred.
Example 1
The concept can be illustrated using the example of a bag containing five red and three white
billiard balls. If two are selected at random without replacement, what is the probability that one
of each colour is drawn?
Let: R indicate a red ball being selected
W indicate a white ball being selected
120
When the second ball is selected, there are only seven balls left and the mix of red and white
balls will be determined by the selection of the first ball – i.e. the probability of a red or a white
ball being selected is conditional upon the previous event.
Thus:
If the first ball selected was red, there will be four red and three white balls left. The probabilities
for the selection of the second ball are then:
4 3
P(R) = and P(W) =
7 7
If the first ball selected was white, there will be five red and two white balls left. The
probabilities for the selection of the second ball are then:
5 2
P(R) = and P(W) =
7 7
This process of working out the different conditional probabilities can be very complicated and it
is easier to represent the situation by means of a tree diagram as in Figure 11.1. The probabilities
at each stage are shown alongside the branches of the tree.
121
the branch the probability of the outcome of this simple experiment being along that
branch.
We then consider drawing a second ball from the bag. Whether we draw a red or a white
ball the first time, we can still draw a red or a white ball the second time, so we mark in
the two possibilities at the end of each of the two branches of our existing tree diagram.
We enter on these second branches the conditional probabilities associated with them.
Thus, on the uppermost branch in the diagram we must insert the probability of obtaining
a second red ball given that the first was red.
Each complete branch from start to tip represents one possible outcome of the compound
experiment and each of the branches is mutually exclusive. We can see that there are four
different mutually exclusive outcomes possible, namely RR, RW, WR and WW.
122
Example 2
A bag contains three red balls, two white balls and one blue ball. Two balls are drawn at random
(without replacement). Find the probability that:
(a) Both white balls are drawn.
(b) The blue ball is not drawn.
(c) A red then a white are drawn.
(d) A red and a white are drawn.
To solve this problem, let us build up a tree diagram.
Stage 1 will be to consider the situation when the first ball is drawn:
Figure 7.2: Example 2 – Tree diagram of first event
We have given the first ball drawn a subscript of 1 – for example, red first = R 1. We shall give
the second ball drawn a subscript of 2.
123
2
P(W1) =
6
1
(P(B1) =
6
The possibilities in stage 2 may now be mapped on the tree diagram. When the second ball is
drawn, the probability of it being red, white or blue is conditional on the outcome of the first
event. Thus, note there was only one blue ball in the bag, so if we picked a blue ball first then we
can have only a red or a white second ball. Also, whatever colour is chosen first, there are only
five balls left as we do not have replacement.
We can now list all the possible outcomes, with their associated probabilities.
124
Table 7.4: Example 2 – Probabilities of outcomes from compound (conditional) events
125
1 1 2
P(R1W2) + P(W1R2) = + =
5 5 5
126
CHAPTER 8: TIME SERIES
8.0 Introduction
Businesses and governments use statistical analysis of information collected at regular intervals
over extensive periods of time to plan future policies. For example, sales values or
unemployment levels recorded at yearly, quarterly or monthly intervals are examined in an
attempt to predict their future behaviour. Such sets of values observed at regular intervals over a
period of time are called time series.
The analysis of this data is a complex problem as many variable factors may influence the
changes which take place. The first step in analysing the data is to plot the observations on a
scattergram, with time evenly spaced across the x axis and measures of the dependent variable
forming the y axis. This can give us a good visual guide to the actual changes, but is very little
help in showing the component factors causing these changes or in predicting future movements
of the dependent variable.
Statisticians have, therefore, constructed a number of models to describe the behaviour of time
series. These assume that changes over time in the variable being studied are caused by the
variation of four main factors. This gives rise to a number of approaches to the analysis of the
inter-relationships between these factors.
8.1 Objectives
By the end of the topic students should be able to:
Demonstrate how time-series analysis can be used in business forecasting. This includes the use
of the additive and multiplicative models to ‘decompose’ time-series data, the calculation of
trends and cyclical and seasonal patterns, and simple forecasting.
127
Table 8.1: Days Lost Through Sickness at a Thermometer Factory
128
Every historigram will look similar to this, depending on the scale of the y axis, but a careful
study of the change of pattern over time will suggest which model should be used for analysis.
Figure 8.1: Scattergram of days lost through sickness (1996 – 1999)
There are four factors that influence the changes in a time series – trend, seasonal variations,
cyclical fluctuations, and irregular or random fluctuations.
8.2.2Trend
This is the change in general level over the whole time period and is often referred to as the
secular trend. You can see in Figure 10.1 that the trend is definitely upwards, in spite of the
obvious fluctuations from one quarter to the next.
A trend can thus be defined as a clear tendency for the time series data to travel in a particular
direction in spite of other large and small fluctuations.
An example of a linear trend is shown in Figure 8.2. There are numerous instances of a trend –
for example, the amount of money collected from taxpayers is always increasing and, therefore,
any time series describing government income from tax would show an upward trend.
129
Figure 8.2: Example of a linear trend line
If you can establish a pattern to the variation throughout the year, then this seasonal variation is
likely to be similar from one year to the next. Thus, it would be possible to allow for it when
estimating values of the variable in other parts of the time series. The usefulness of being able to
calculate seasonal variation is obvious as, for example, it allows ice-cream manufacturers to alter
their production schedules to meet these seasonal changes.
Figure 8.3 shows a typical seasonal variation that could apply to the examples above.
Figure 8.3: Example of seasonal variation
130
8.2.4 Cyclical Fluctuations
These are long-term, but fairly regular variations in the pattern of the variable. They are difficult
to observe unless you have access to data over an extensive period of time during which external
conditions have remained relatively constant. For example, it is well known in the textile trade
that there is a cycle of about three years, during which time demand varies from high to low.
This is similar to the phenomena known as the trade cycle which many economists say exists in
the trading pattern of most countries, but for which there is no generally accepted explanation. It
is also thought that weather patterns may be cyclical in nature over long periods.
Figure 8.4 shows how such a cyclical fluctuation would relate to an upward trend. In our
example on sickness, a cyclical fluctuation could be caused by, say, a two-year cycle for people
suffering from influenza.
131
As this type of fluctuation is difficult to determine, it is often considered with the final (fourth)
element, and the two together are called the residual variation.
132
Irregular fluctuations (I)
It is possible for the relationship between these factors and the time series to be expressed in a
number of ways through the use of different mathematical models. Here, we shall consider in
detail the additive model.
The additive model, as its name suggests, can be expressed by the equation:
Series = Trend + Seasonal variation + Cyclical fluctuations + Random fluctuations
i.e. Y=T+S+C+I
As we noted, the cyclical and random fluctuations are often put together and called the “residual”
(R), so we get:
Y=T+S+R
We shall only be concerned here with the trend, the other components being outside the scope of
our studies at this time.
8.3.1 Trend
This is the most important factor of a time series.
Before deciding on the method to be used in finding it, we must decide whether the conditions
that have influenced the series have remained stable over time.
For example, if you have to consider the production of some commodity and want to establish
the trend, you should first decide if there has` been any significant change in conditions affecting
the level of production, such as a sudden and considerable growth in the national economy. If
there has, you must consider breaking the time series into sections over which the conditions
have remained stable.
Having decided the time period you will analyse, you can use any one of the following methods
to find the trend. The basic idea behind most of these methods is to average out the three other
factors of variation so that you are left with the long-term trend.
133
(b) Semi-Averages Method
This is a simple method which involves very little arithmetic. The time period is divided into
equal parts, and the arithmetic means of the values of the dependent variable in each half are
calculated. These means are then plotted at the quarter and three-quarters position of the time
series. The line adjoining these two points represents the trend of the series. Note that this line
will pass through the overall mean of the values of the dependent variable.
To calculate the mean of the first half of the time series, we need to divide the total of the values
of the variable over that period by the number of individual time periods covered. In our example
of days lost through sickness, the total time period covered is five years. The midpoint of the
whole series is, then, mid-way between quarter 2 and quarter 3 of 1997.
Taking the data for this first half of the series, we get:
Table 8.2: Days lost
134
To calculate the mean of the second half of the period, we need to consider the data from the
midpoint onwards:
These values are plotted on the historigram in Figure 8.5. You will notice that 30.3 days, as it is
the mean for the first half, is plotted halfway between quarters 1 and 2 of 1996, and likewise 44.2
days is plotted halfway between quarters 3 and 4 of 1998. The trend line is then drawn between
these two points and it can be extrapolated beyond these points as shown by the dotted line.
If there is an odd number of observations in the time series, the middle observation is ignored
and the means of the observations on each side of it are calculated.
135
(c) Moving Averages Method
So far, the methods we have discussed for finding trends have resulted in a straight line, but the
actual trend may be a curve or a series of straight segments. The method of moving averages
gives a way of calculating and plotting on the historigram a trend point corresponding to each
observed point. These points are calculated by averaging a number of consecutive values of the
dependent variable so that variations in individual observations are reduced.
The number of consecutive values selected will depend on the length of the short-term or
seasonal variation shown on the historigram.
The method of calculating a set of moving averages is illustrated by the following simple
example. Consider a time series comprising the seven numbers 6, 4, 5, 1, 9, 5, 6. If we take the
number of time periods covered by the seasonal variations to be four (as in quarterly figures over
each year), then a “moving average of order four” is needed. The calculation then proceeds as
follows.
Step 1: Find the average of the first to fourth numbers in the series:
136
6+ 4+5+1
Average =4
4
Step 2: Find the average of the second to fifth numbers in the series:
4+ 5+1+9
Average = =¿ 4.75
4
Step 3: Find the average of the third to sixth numbers in the series:
5+1+9+5
Average = =¿ 5
4
Step 4: Find the average of the fourth to seventh numbers in the series:
1+ 9+5+6
Average = =¿5.25
4
(If the series was longer, we could continue taking the average of the each successive set of four
numbers in the series.) Hence, in our simple example, the moving averages of order 4 are 4, 4.75,
5, 5.25, and these could be plotted on the historigram.
We selected a moving average of order 4 here since the data was quarterly. If the data was
monthly, a moving average of order 12 would be needed, and for daily data, the order would be
7, and so on. Returning now to our earlier example of days lost through sickness, we can
calculate the trend values and plot them on Figure 8.6. A moving average of order 4 will again be
used since the data is quarterly.
Notice how the table of calculations is set out, with the numbers in columns (4) and (5) placed
midway between the four quarterly readings to which they relate. This is because we are
averaging over an even number of values, so the moving average would have to be plotted in this
position on the historigram and would not correspond to any particular quarter.
This makes it necessary to add column (6) which gives the mean of successive pairs of moving
averages and these numbers are the trend values plotted in Figure 8.6 against particular quarters.
(The values in column (6) are often called the centred moving averages.) The trend values are
given correct to one decimal place as this is the greatest level of accuracy justified by the
accuracy of the data.
If we were calculating a moving average with an odd number of values, it would not be
necessary to carry out this final stage as the moving averages would be centred on an actual
observation and so would be the trend values.
137
The table of calculations is as follows.
Table 8.4: Calculation table for moving averages method of determining trend
138
However, this method has three disadvantages:
The trend line cannot be found for the whole of the time series. As you can see from our
example, there are no trend values for quarters at the beginning and end of the series.
Problems can be encountered in deciding the order number, i.e. the period of fluctuation.
Unless the seasonal or cyclical movement is definite and clear cut, the moving method of
deriving the trend may yield a rather unsatisfactory line.
Since the trend is calculated as a simple arithmetic mean it can be unduly influenced by a
few extreme values.
8.4 Forecasting
The reason for isolating the trend within a time series is to be able to make a prediction of its
future values and thus estimate the movement of the time series.
In considering forecasting, there are two key assumptions which must be made at the outset.
139
Those conditions and factors which were apparent during the period over which the trend was
calculated must be assumed to be unchanged over the period for which the forecast is made. If
they do change, then the trend is likely to change with them, thus making any predictions
inaccurate. For example, forecasts of savings trends based on given interest rates will not be
correct if there is a sudden change either up or down in those rates.
The method involves extending the moving average trend line drawn on the historigram of the
time series. The trend line is extended by assuming that the gradient remains the same as that
calculated from the data. The further forward you extend it, the more unreliable becomes the
forecast.
Although fairly easy to calculate, this forecast must be treated with caution because it is based on
the value of the trend calculated for the last period for which definitive data is available – if this
happens to be an especially high or low value then it would influence the trend, and thus the
forecast, considerably.
140
REFERENCES
Beck, S. E. and Manuel K. (2007). Practical Research Methods for Librarians and Information
Professionals. New York: Neal-Schuman
Dunteman G.H and Moon-Ho R. (2009). Introduction to Generalized Linear Models. Thousand
Oaks, CA Sage Publications.
Finkelstein M. O (2009) Basic Concepts of Probability and Statistics in the Law. New York,
NY: Springer.
Glyn B. Carrol G and Wall S. (2002). Quantitative Methods for business and economics.
Prentice Hall.
Osborne J. W. (2008). Best practices in quantitative methods. Thousand Oaks, Calif: Sage
Publications.
141