Assignment
Assignment
PRESENTATION
2 OF DATA
INTRODUCTION:
Before applying any statistical technique on the raw data, we must arrange and classify the data
in the systematic form. So that the statistical work become simple and easy. This is called presentation
of data.
Usually following four methods are used for the presentation of data.
(i) Classification (ii) Tabulation (iii) Diagrammatical (iv) Graphical
CLASSIFICATION:
The process of arranging data into classes or categories according to some common
characteristics present in the data is called as classification.
The Basis of Classification:
There are four important bases for classification of data.
(i) Qualitative base (ii) Quantitative base
(iii) Geographical base (iv) Chronological base
(i) Qualitative Base:
The classification is called Qualitative when the data are classified by qualities or attributes such as
gender, marital status, employment status, religion, beauty etc.
(ii) Quantitative Base:
The classification is called Quantitative when the data are classified by quantitative characteristics such
as heights, age, weight, distance, length, income etc.
(iii) Geographical Base:
The classification is called Geographical when the data are classified by geographical regions or
locations. For example, the population of country may be classified by provinces, division, districts, tehsils or
towns etc.
(iv) Chronological Base:
The classification is called Chronological when the data are arranged by successive time periods. For
example, the monthly sale of a departmental store, yearly enrollment of students in M.A.O. College, hourly
temperature recorded by weather bureau etc.
Types of Classification:
Some important types of classification are;
(i) One way classification. (ii) Two way classification.
(iii) Three way classification. (iv) Many way classification.
(i) One Way Classification:
When the data are classified by one characteristic, then the classification is said to be one way.
For example, the population of country may be classified by religions as Muslims, Christians and Sikhs.
(ii) Two Way Classification:
When the data are classified by two characteristics simultaneously (at a time), then classification is said
to be two way. For example, the students of Punjab University, Lahore may be classified by Age and Height.
(iii) Three Way Classification:
When the data are classified by three characteristics simultaneously, then classification is said to be
three way. For example, the population of city Lahore may be classified by Religion, Sex and Literacy rate.
(iii) Many Way Classification:
When the data are classified by many characteristics simultaneously, then the classification is said to
be many way. For example, the population of city Lahore may be classified by Religion, Sex, age, height,
Literacy rate etc.
TABULATION:
The process of systematic arrangement of data into rows and columns is called tabulation.
Classification is first step of tabulation. Tabulation may be single, double or manifold depending on the type of
classification.
Main Parts of Table and its Construction:
A statistical table has at least four major parts as;
(i) The title (ii) The box head
(iii) The stub (iv) The body of table
In addition some tables have some other minor parts as;
(v) Prefatory Note or Head Note (vi) Foot Note
(vii) Source Note
…………………………….TITLE…………………………...
(Prefatory Notes)
Column Captions
BOX HEAD
Row Captions
STUB
......... …….. ...B O D Y… …….. ……..
Foot note………..
Source note……..
Example 2.1
Represent the data given in the following paragraph in the form of a table, so as to bring out clearly all
the facts, including the source and bearing suitable title;
“According to the census of Manufactures Report 1945, the John Smith Manufacturing Company
employed 400 non-union and 1250 union employees in 1941. Of these 220 were females of which 140 were
non-union. In 1942, the number of union employees increased to 1475 of which 1300 were males. Of the 250
non-union employees 200 were males. In 1943, 1700 employees were union members and 50 were non-union.
Of all the employees in 1943, 250 were females of which 240 were union members. In 1944, the total number
of employees was 2000 of which one percent was non-union. Of all the employees in 1944, 300 were females
of which only 5 were non-union.”
Solution:
Title DISTRIBUTION OF EMPLOYEES OF THE JOHN SMITH MANUFACTURING COMPANY
BY SEX AND MEMBERSHIP DURING 1941 TO 1944
BOX HEAD
Captions
All Union Non-union
Year Total Male Female Total Male Female Total Male Female
BODY
220 1250 1170 80 400 260 140
STUB
DIAGRAMS OR CHARTS:
A diagram is any one, two or three-dimensional form of graphical representation. The commonly used
diagrams or charts are as;
(i) Simple Bar Chart (ii) Multiple Bar Chart (iii) Component Bar Chart or Sub-divided Bar Chart
(iv) Percentage Component Bar Chart (v) Rectangular Bar Chart (vi) Pie chart
Simple Bar Chart or Diagram:
Simple Bar Chart is used to represent the data having a single variable. The vertical or horizontal bars
are made to represent the data when the difference between different quantities is usually small. The width of
the bars always uniform and has no significance. The length of the bars is proportional to the size of quantities.
The space between the bars should not be more than the width of bars and should not be less than half of its
width. The vertical bars are used to represent time series or quantitative data while horizontal bars are used to
represent qualitative or geographical data. A data which do not belong to time should be arranged in ascending
or descending order before drawing chart.
Example 2.10
The following table shows the production of wheat in Pakistan during the year 2001 to 2006. Represent
the data by a Simple Bar Chart
Years 2001 2002 2003 2004 2005 2006
Production(Lakh tons) 64 68 73 75 71 81
Solution:
100
80
Production
60
40
20
0
2001 2002 2003 2004 2005 2006
Years
Pie Chart:
Pie Chart has the same function as sub-divided rectangular chart. The only difference between them is
that “in Pie Chart the circles are used instead of rectangles”. A Pie Chart is consisting of a circle divided into
different sectors or pie shaped pieces whose areas are proportional to the various parts into which whole
quantity is divided. The sectors are shaded differently to show the relationship of parts with the whole. A pie
Chart is also known as Sector Diagram.
To construct the Pie Chart, draw a circle of any convenient radius. The whole quantity to be displayed
is equal to 360 because a total angle of circle is 3600. So the angles for each component are calculated and
these angles are used to show different components. The angles are calculated by the following formula;
Component part
Angle = 3600
WholeQuantity
Then divide the circles into different sectors by constructing angles at the center with the help of a
protractor.
Example 2.11
The following table gives expenditures in rupees of a Family on different commodities or items.
Represent the data by a Pie Chart.
Items Expenditure in Rs.
Food 190
Clothing 64
Rent 100
Medical Care 46
Other items 80
Solution:
Expenditure
Items Angles of the Sectors
in Rs.
190
Food 190 360 142.50
480
64
Clothing 64 360 480
480
100
Rent 100 360 750
480
46
Medical Care 46 360 34.50
480
80
Other items 80 360 600
480
TOTAL 480 3600
PIE CHART SHOWING EXPENDITURES IN RUPEES OF DIFFERENT COMMODITIES OF A
FAMILY
GRAPHS:
Diagrams fail to represent a statistical series spread over a time, or a frequency distribution, or two
related variables in visual form. So Graphs are used for such representations.
A Graph consists of a straight line or a curve and presents the data in a simple and effective manner.
Graphs are used to make comparison between two or more than two statistical series. Sometime Graphs may
also be used to make predication and forecasts.
Types of Graphs:
Graphs can be divided into two main categories as;
(a) Graph of time series (Historigram) (b) Graph of frequency distribution
Here we will only discuss the graph of frequency distribution.
GRAPHS OF FREQUNCY DISTRIBUTION:
The important graphs of frequency distributions are;
(i) Histogram (ii) Frequency Polygon (iii) Frequency Curve (iv) Cumulative frequency Curve or Ogive.
Histogram:
A Histogram consists of a set of adjacent rectangles in which class boundaries are marked along X-axis
and frequencies are taken on Y-axis. When the class intervals are equal then the rectangles all have the same
width and the heights of rectangles are directly proportional to the respective class frequencies. If the class
intervals are not equal, then the heights of the rectangles have to be adjusted accordingly. To adjust the
heights of the rectangles in frequency distributions, each class frequency is divided by its class interval size.
Example 2.12
Construct Histogram for the following frequency distribution.
Classes 10-14 15-19 20-24 25-29 30-34 35-39 40-44
f 4 12 25 30 25 15 6
Solution:
f 4 12 25 30 25 15 6
HISTOGRAM
Example 2.13
Construct Histogram for the following frequency distribution.
Solution:
Class Interval Adjusted
C–I frequency C–B
Size frequency
4
10 – 11 4 9.5 – 11.5 2 2
2
12
12 – 14 12 11.5 – 14.5 3 4
3
25
15 – 19 25 14.5 – 19.5 5 5
5
60
20 – 29 60 19.5 – 29.5 10 6
10
25
30 – 34 25 29.5 – 34.5 5 5
5
15
35 – 39 15 34.5 – 39.5 5 3
5
6
40 – 42 6 39.5 – 42.5 3 2
3
Frequency Polygon:
A second useful way of presenting a frequency distribution in graphic form is frequency polygon. A
frequency polygon is a line graph obtained by plotting class frequencies against class marks and then joining
the consecutive points by a straight line. A frequency polygon can also be obtained by joining the mid points of
the tops of the rectangles in the Histogram. The ends of the graphs do not meet the X-axis. Because a polygon
is a many sided closed figure, we, therefore, add extra classes on both ends of the frequency distribution with
zero frequencies. In this way we get the frequency polygon.
We used frequency polygon instead of Histogram, when two frequency distributions are to be
compared.
A frequency polygon gives rough idea about the mode, skewness and kurtosis of the curve.
Example 2.14
Draw a frequency polygon for the following frequency distribution.
Classes 60-62 63-65 66-68 69-71 72-74 75-77 78-80
f 4 9 14 18 12 7 3
Solution:
C–B 59.5-62.5 62.5-65.5 65.5-68.5 68.5-71.5 71.5-74.5 74.5-77.5 77.5-80.5
f 4 9 14 18 12 7 3
X 61 64 67 70 73 76 79
FREQUENCY POLYGON
Alternative Method
FREQUENCY POLYGON
Frequency Curve:
If the curve of the frequency polygon is smoothed, it is called as frequency curve or if in the frequency
polygon, the plotted points are joined by a freehand drawing method instead of joined by a straight line, we get
the frequency curve. A frequency curve should not touch the X-axis.
Example 2.15
Draw a frequency polygon for the following frequency distribution.
Classes 60-62 63-65 66-68 69-71 72-74 75-77 78-80
f 4 9 14 18 12 7 3
Solution:
C–I 60-62 63-65 66-68 69-71 72-74 75-77 78-80
f 4 9 14 18 12 7 3
X 61 64 67 70 73 76 79
FREQUENCY CURVE