chapter2
chapter2
03/25/2025
i. planning
Identify source and elements of the data.
Decide whether to consider sample or census.
If sampling is preferred, decide on sample size,
selection method,… etc
Decide measurement procedure.
Set up the necessary organizational structure
03/25/2025
ii. Measuring: there are different options.
Focus Group
Telephone Interview
Mail Questionnaires
Door-to-Door Survey
Mall Intercept
New Product Registration
Personal Interview and
Experiments are some of the sources for collecting the
primary
03/25/2025
data.
2. Secondary Data
Note: Data which are primary for one may be secondary for the other.
METHODS OF DATA PRESNTATION
Definitions:
Raw data: recorded information in its original collected form,
03/25/2025
whether it be counts or measurements, is referred to as raw data.
Frequency: is the number of values in a specific class
of the distribution.
Frequency distribution: is the organization of raw
data in table form using classes and frequencies.
Example: student age distribution
03/25/2025
Why Use Frequency Distributions?
The reasons for constructing a frequency distribution are as follows:
To organize the data in a meaningful, intelligible way.
To enable the reader to determine the nature or shape of the
distribution
To facilitate computational procedures for measures of average and
spread
To enable the researcher to draw charts and graphs for the
presentation of data
To enable the reader to make comparisons between different data
set
Types of Frequency Distribution
1. Categorical
2. Ungrouped 3. Grouped
03/25/2025
1. categorical frequency distribution
• Used for data that can be place in specific categories
such as nominal, or ordinal. e.g. marital status.
Example: a social worker collected the following data
on marital status for 25 persons.(M=married, S=single,
W=widowed, D=divorced) construct categorical
frequency distribution
03/25/2025
2. Ungrouped frequency distribution
- Is a table of all potential values that could possibly occur
in the data collection along with their corresponding
frequencies.
Example: Consider age of 20 students who read in library last night
30, 41, 39, 41, 32, 29, 35, 31, 30, 36, 33, 36, 32, 42, 30, 35, 37,
32,30, and 41.
03/25/2025
3. Grouped Frequency distribution
When the range of the data set is large, the data must be grouped in to
classes that are more than one unit in width.
03/25/2025
Definitions:
Grouped Frequency Distribution: a frequency distribution when several numbers are grouped
in one class.
Class limits: Separates one class in a grouped frequency distribution from another.
The limits could actually appear in the data and have gaps between the upper limits of one
class and lower limit of the next.
Units of measurement (U): the distance between two possible consecutive measures. It is
usually taken as 1, 0.1, 0.01, 0.001, -----.
Class boundaries: Separates one class in a grouped frequency distribution from another.
The boundaries have one more decimal places than the row data and therefore do not
appear in the data.
Class width: the difference between the upper and lower class boundaries of any class.
It is also the difference between the lower limits of any two consecutive classes or the
difference between any two consecutive class marks.
Class mark (Mid points): it is the average of the lower and upper class limits or the average of
upper and lower class boundary.
Cumulative frequency: is the number of observations less than/more than or equal to a specific
value
03/25/2025
• Cumulative frequency above: it is the total frequency of all values
It can be more than or less than type, depending on the type of cumulative
frequency used.
2. The classes must be mutually exclusive. This means that no data value can
3. The classes must be all inclusive or exhaustive. This means that all data
distribution.
5. The classes must be equal in width. The exception here is the first or last
class. It is possible to have an "below ..." or "... and above" class. This is
11 29 6 33 14 31 22 27 19 20
18 17 22 38 23 21 26 34 39 27
6 11 14 17 18 19 20 21 22 22 23 26 27 27 29 31 33 34 38 39
lutions:
Step 1: Arrange the data in ascending order.
Step 2: Find the range (R) : 𝑅 = 𝑀𝑎𝑥 − 𝑀𝑖𝑛 = 39 − 6 = 33.
Step 3: Select the number of classes desired using Sturges formula;
𝑘 = 1 + 3.322 𝑥 𝑙𝑜𝑔𝑛 =𝑘 = 1+ 3.322 𝑥 𝑙𝑜𝑔ሺ20ሻ = 5.32 ≈ 5 (𝑟𝑜𝑢𝑛𝑑𝑖𝑛𝑔 𝑑𝑜𝑤𝑛).
Step 4: Find the class width; 𝑤= 𝑘 = 𝑤= = 6.6 ≈ 7 ሺ𝑟𝑜𝑢𝑛𝑑𝑖𝑛𝑔 𝑢𝑝ሻ .
𝑅 33
5
Class limits
6 – 12
13 – 19
20 – 26
27 – 33
34 – 40
03/25/2025
Step 6: Find the class boundaries;
03/25/2025
The complete frequency distribution is given as follows
Class Class Class f Lcf Mcf rf. %rf %rcf
limit boundary Mark
03/25/2025
Diagrammatic and Graphic presentation of data
These are techniques for presenting data in visual displays using
geometric and pictures.
Importance:
They have greater attraction.
They facilitate comparison.
They are easily understandable.
Diagrams are appropriate for presenting discrete data.
The three most commonly used diagrammatic presentation
for discrete as well as qualitative data are:
1. Pie charts
2. Pictogram
3. Bar charts
03/25/2025
1. Pie charts
Boys Men
G ir ls Women
03/25/2025
2. Pictogram
03/25/2025
Bar Charts:
03/25/2025
There are different types of bar charts.
The most common are :
A 12 14 18
B 24 21 18
C 24 35 54
03/25/2025
a. Simple Bar chart
Sales by product in 1957
30
25
Sales in $
20
15
10
5
0
A B C
product
03/25/2025
2. Component Bar Chart
SALES BY PRODUCT 1957-1959
100
80
Sales in $
Product C
60
Product B
40
Product A
20
0
1957 1958 1959
Year of production
03/25/2025
3. Multiple Bar Chart
60
50
Sales in $
40 Product A
30 Product B
20 Product C
10
0
1957 1958 1959
Year of production
03/25/2025
Graphical Presentation of data
• The histogram, frequency polygon and cumulative frequency graph or ogive are
most commonly applied graphical representations for continuous data.
a. Histogram
Interval Frequency
(No gaps
between 0 10 20 30 40 50 60
bars) 70 Temperature in Degrees
03/25/2025
b. Frequency Polygon
– A line graph.
– The frequency is placed along the vertical axis and classes
mid points are placed along the horizontal axis.
– It is customer to the next higher and lower class interval
with corresponding frequency of zero, this is to make it a
complete polygon.
03/25/2025
03/25/2025
c. Ogive (cumulative frequency polygon)
• A graph showing the cumulative frequency (less
than or more than type) plotted against upper or
lower class boundaries respectively.
• Class boundaries along the horizontal axis
corresponding cumulative frequencies are along
the vertical axis. The points are joined by a free
hand curve.
03/25/2025
20
Less than Ogive
15
10