Statistics Notes
Statistics Notes
from the Department of Health and choose Elements are grouped into hierarchy of units and
his sample from this list. sampling is done successively.
SYSTEMATIC SAMPLING Example:
A list of all members of o Suppose we wish to study the expenditure
the population is necessary. patterns of households in the Province of
To determine the sample to Iloilo. We can select a sample of households
be taken from the for this study using multistage (three-
population, you can stage) sampling where the primary stage
systematically get from the units are the cities/municipalities, the
list of all those whose names second-stage units are the barangays, and
are assigned to odd numbers or all names with even the third-stage units are the households.
numbers, or you can get those whose names start o Your research objective is to evaluate online
with a vowel and get every nth name in the master spending patterns of households in the US
list. through online questionnaires. You can
Example: form your sample group comprising 120
o Suppose we wish to conduct a survey on the households in the following manner:
opinions of senior citizens on the
computerized registration system. We can
get the list of senior citizens from the Office
of the Senior Citizen Association. This will
serve as our sampling frame. Arrange the
names alphabetically and systematically
get from the list of all those whose names
are assigned to odd numbers or all names
with even numbers, or you can get those
whose names start with a vowel and get
every nth name in the master list.
STRATIFIED SAMPLING TYPES OF NON-PROBABILITY SAMPLING
Divide the total population into strata. Each stratus Four main techniques used for a non-probability sample:
is composed of a more or less homogeneous sub- Haphazard or Convenience
population group but they differ from stratum to Quota
stratus in the total population. Purposive or Judgemental
Snowball
HAPHAZARD OR CONVENIENCE SAMPLING
The sample consists of elements that are most
accessible or easiest to contact. This usually includes
friends, acquaintances, volunteers, and subjects who
are available and willing to participate at the time
of the study such as the person interviewed at
Example: random in a shopping center for a television
program.
Example:
o The adviser of a student organization is
conducting a research on study habits of
students in the university. To select a
sample, the adviser includes the members
of the student organization because it is
easy to reach them and get data from them.
The adviser did not make use of any
randomization mechanism in the selection
of the units in sample. Rather, convenience
CLUSTER SAMPLING (ONE STAGE SAMPLING) was the sole criterion for selection.
Population is grouped into clusters or small units o A group of social scientists is interested in
composed of population elements, and the number of studying the socioeconomic profile of
these population clusters are chosen by simple persons with Acquired Immune Deficiency
random sampling or by systematic sampling with Syndrome (AIDS). In most cases the
random start. subjects with the disease will not admit
that she or he is a carrier in an ordinary
interview. There is also no complete list of
persons with AIDS. We cannot ask
hospitals to give us a list of patients
afflicted with the disease since this
information is confidential.
o Thus, in conducting the survey, the
MULTISTAGE SAMPLING
researchers sought the assistance of doctors
Statistical Biology 5
MODE
Occurs most frequently.
Nominal average
MEASURES OF LOCATION May or may not exist
A Measure of Location summarizes a data set by
giving a “typical value” within the range of the data values
that describes its location relative to entire data set.
Some Common Measures:
Minimum, Maximum
Central Tendency
Percentiles, Deciles, Quartile
Properties of a Mode
MAXIMUM AND MINIMUM
Can be used for qualitative as well as quantitative
Minimum is the smallest value in the data set,
data may not be unique.
denoted as MIN.
Not affected by extreme values.
Maximum is the largest value in the data set,
Can be computed for ungrouped and grouped data.
denoted as MAX.
Types of Modes
Statistical Biology 7
Example:
Suppose LJ was told that relative to the other scores on a
certain test, his score was the 95th percentile.
This means that 95% of those who took the test had
scores less than or equal to LJ’s score, while 5% had
scores higher than LJ’s.
DECILES Some Properties of IQR
Divide an array into ten equal parts, each part Reduces the influence of extreme values.
having ten percent of the distribution of the data Not as easy to calculate as the Range.
values, denoted by Dj. VARIANCE
The 1st decile is the 10th percentile; the 2nd decile Important measure of variation.
is the 20th percentile….. Shows variation about the mean.
QUARTILES
Divide an array into four equal parts, each part
having 25% of the distribution of the data values,
denoted by Qj.
The 1st quartile is the 25th percentile; the 2nd STANDARD DEVIATION
quartile is the 50th percentile, also the median and Most important measure of variation.
the 3rd quartile is the 75th percentile. Square root of Variance
Has the same units as the original data.
MEASURES OF VARIATION
A measure of variation is a single value that is used
to describe the spread of the distribution.
o A measure of central tendency alone does
not uniquely describe a distribution.
TWO TYPES OF MEASURES OF DISPERSION
Absolute Measures of Dispersion:
Range
Inter-quartile Range
Variance
Standard Deviation
Relative Measure of Dispersion:
Coefficient of Variation
RANGE
Remarks on SD
Statistical Biology 8