Chapter 2 - Organization and Presentation of Data: Learning Outcomes
Chapter 2 - Organization and Presentation of Data: Learning Outcomes
Introduction
Learning Outcomes
At the end of the chapter, you are expected to:
1. organize data systematically;
2. construct a frequency distribution for a given set of data; and
3. choose and construct the appropriate graphs for particular type of data;
Data are usually collected in a raw format and thus the inherent information is difficult to
understand. Therefore, raw data need to be summarized, processed, and analyzed. However, no
matter how well manipulated, the information derived from the raw data should be presented in an
effective format, otherwise, it would be a great loss for both authors and readers.
Data are set of facts, and provide a partial picture of reality. Whether data are being collected
with a certain purpose or collected data are being utilized, questions regarding what information the
data are conveying, how the data can be used, and what must be done to include more useful
information must constantly be kept in mind.
Collected data must be organized in order to show significant characteristics. They can be
presented in three forms:
1. Textual, where data is presented in paragraph form - Text is the principal method for
explaining findings, outlining trends, and providing contextual information.
2. Tabular, where data is presented in rows and columns - A table is best suited for
representing individual information and represents both quantitative and qualitative
information.
3. Graphical, where data is presented in visual form - A graph is a very effective visual tool as
it displays data at a glance, facilitates comparison, and can reveal trends and relationships
within the data such as changes over time, frequency distribution, and correlation or
relative share of a whole.
Text, tables, and graphs for data and information presentation are very powerful
communication tools. They can make an article easy to understand, attract and sustain the interest of
readers, and efficiently present large amounts of complex information
nds* 2020-2021
1
Psychological Statistics
FREQUENCY TABLE
Data can be presented in various forms depending on the type of data collected. A frequency
distribution is a table showing how often each value (or set of values) of the variable in question
occurs in a data set. A frequency table is used to summarize categorical or numerical data.
Frequencies are also presented as relative frequencies, that is, the percentage of the total number in
the sample.
nds* 2020-2021
2
Psychological Statistics
GRAPHICAL METHODS
Frequency distributions and are usually illustrated graphically by plotting various types of graphs:
nds* 2020-2021
3
Psychological Statistics
Frequency polygon - This is constructed by plotting the class frequencies against the class marks and
connecting the successive points with a straight line. The heights of the points represent
the frequencies.
Ogive (oh-jive), sometimes called a cumulative frequency polygon, is a type of frequency polygon that
shows cumulative frequencies. An ogive graph plots cumulative frequency on the y-axis and class
boundaries along the x-axis.
FREQUENCY DISTRIBUTION
Grouped data are data formed by aggregating individual observations of a variable into
groups, so that a frequency distribution of these groups serves as a convenient means of summarizing
or analyzing the data.
The grouped frequency table is a statistic method to organize and simplify a large set of
data in to smaller "groups." When a data consists of hundreds of values, it is preferable to group
them in a smaller chunks to make it more understandable. When grouped frequency table is
created, scientists and statistician can observe interesting trends in the data.
nds* 2020-2021
4
Psychological Statistics
The main purpose of the grouped frequency table is to find out how often each value
occurred within each group of the entire data. The group frequency distribution is essentially a
table with two columns. The first column titled "class interval" represents all possible "grouping" of
the data and the second column titled "frequency" represents how frequent each value occurred
within each group.
Activity:
Consider the height, in centimeters, of 100 students in a Statistics class. The heights of
these students are presented below:
158 153 148 162 161 160 170 156 157 166
151 158 152 164 155 160 154 155 166 147
163 165 142 154 156 163 150 168 157 167
138 164 160 161 157 164 160 162 144 163
162 157 155 167 152 158 150 154 160 163
155 158 168 151 172 163 161 164 156 153
158 163 169 167 160 165 159 147 159 162
160 158 153 172 161 144 141 164 156 161
156 157 157 158 164 157 162 160 158 160
162 159 156 161 157 158 160 152 160 153
nds* 2020-2021
5
Psychological Statistics
These heights presented are raw data. Raw data is data collected which have not been
organized numerically. From the raw data, it is difficult for anyone to pick out the salient features of
the data. For this reason, it is essential for the investigator to arrange the collected data in some form
that will enable him to see immediately those features he wishes to determine. Hence, there is really
a need to organize raw data in a more logical order.
The data above can be organized by arranging them either from highest to lowest or vice
versa. Whichever arrangement we consider will give us an easier way of determining some features
about the data. Such organization is called an array.
greater than the lower boundary of a given class interval is called the cumulative frequency up to and
including that of the given class interval. If we obtain the cumulative frequency from the smallest
upper-class boundary up to the highest upper-class boundary, we have a “less than” cumulative
frequency distribution. If we obtain the cumulative frequency from the highest lower-class boundary
up to the smallest lower-class boundary, we have a “greater than” cumulative frequency distribution.
Sometimes, it is more convenient to interpret data in terms of percent. The frequencies in
the different class intervals maybe expressed as a percentage. Such is called relative frequency. The
relative frequency of a class is determined by dividing the frequency of the class by the total
frequency of all the class intervals. For example, in the above frequency distribution, 23/100 or 23%
of the whole group has a height ranging from 158 cm to 160 cm.
Assessment Task
References
Altares, Priscilla S., et. al. 2003. Elementary Statistics: A Modern Approach. Rex Book Store.
Manila, Philippines
Deauna, Melecio C. 1999. Elementary Statistics for Basic Education. Phoenix Publishing House,
Inc. QC. Philippines
nds* 2020-2021
7
Psychological Statistics
nds* 2020-2021
8