M.
RAHAT HUSSAIN
Dadabhoy Institute of Higher Education
Hill Park Campus
[email protected]
Important statistical terms
Population:
a set which includes all
measurements of interest
to the researcher
(The collection of all
responses, measurements, or
counts that are of interest)
Sample:
A subset of the population
?Why sampling
Get information about large populations
Less costs
Less field time
More accuracy i.e. Can Do A Better Job of
Data Collection
When it’s impossible to study the whole
population
Definition of sampling terms
Sampling unit
Subject under observation on which *
information is collected
Example: children <5 years, hospital discharges, health
…events
Sampling fraction
Ratio between sample size and *
population size
Example: 100 out of 2000 (5%)
Definition of sampling terms
Sampling frame
List of all the sampling units from which sample *
is drawn
Lists: e.g. children < 5 years of age, households, health
…care units
Sampling scheme
* Method of selecting sampling units from
sampling frame
…Randomly, convenience sample
Types of sampling
Sampling Techniques
Nonprobability Probability
Sampling Techniques Sampling Techniques
Convenience Judgmental Quota Snowball
Sampling Sampling Sampling Sampling
Simple Random Systematic Stratified Cluster Other Sampling
Sampling Sampling Sampling Sampling Techniques
Types of sampling
Non-probability samples
Convenience Sampling
Judgmental Sampling
Quota Sampling
Snowball Sampling
Types of sampling
Probability samples
Simple Random Sampling
Systematic Sampling
Stratified Sampling
Cluster Sampling
Other Sampling Techniques
Non probability samples
Convenience sampling
Attempts to obtain a sample of convenient
elements. Often, respondents are selected
because they happen to be in the right place
.at the right time
use of students, and members of social organizations *
mall intercept interviews without qualifying the *
respondents
department stores using charge account lists *
people on the street” interviews“ *
Non probability samples
Judgemental sampling
Is a form of convenience sampling in which
the population elements are selected based
.on the judgment of the researcher
test markets *
purchase engineers selected in industrial *
marketing research
expert witnesses used in court *
Non probability samples
Snowball sampling
An initial group of respondents is selected,
usually at random.
Quota sampling
May be viewed as two-stage restricted judgmental
.sampling
The first stage consists of developing control *
.categories, or quotas, of population elements
In the second stage, sample elements are *
.selected based on convenience or judgment
Probability samples
Random sampling
Each subject has a known probability of being
selected
Allows application of statistical sampling
theory to results to:
Generalise
Test hypotheses
Methods used in probability
samples
Simple random sampling
Systematic sampling
Stratified sampling
Multi-stage sampling
Cluster sampling
Simple random sampling
Principle *
Equal chance/probability of drawing each unit -
Procedure *
Take sampling population -
Need listing of all sampling units -
Number all units -
Randomly draw units -
Simple random sampling
Simple random sampling
Advantages
Simple -
Sampling error easily measured -
Disadvantages
Need complete list of units -
Does not always achieve best -
representativeness
Units may be scattered and poorly -
accessible
Systematic sampling
Principle *
Select sampling units at regular intervals -
Procedure *
Arrange the units in some kind of sequence -
Divide total sampling population by the -
designated sample size (eg 1200/60=20)
Choose a random starting point (for 20, the -
starting point will be a random number
)between 1 and 20
Select units at regular intervals (in this -
)case, every 20th unit
Systematic sampling
Systematic sampling
Advantages
Ensures representativity across list -
Easy to implement -
Disadvantages
Need complete list of units -
Periodicity-underlying pattern may be a problem -
)characteristics occurring at regular intervals(
Stratified sampling
When to use *
Population with distinct subgroups -
Procedure *
Divide (stratify) sampling frame into -
.homogeneous subgroups (strata) e.g
.age-group, urban/rural areas, regions
Draw random sample in each stratum -
If strata population size unequal: sample
same proportion of subjects from each
stratum (the same sampling fraction is used, so
probability proportional to size)
Stratified sampling
Selecting a sample with probability proportional to size
Area Population Proportion Sample size Sampling
size fraction
Urban 7000 70% 1000 x 0.7 = 700 10 %
Rural 3000 30% 1000 x 0.3 = 300 10 %
Total 10000 1000
Stratified sampling
Advantages
Can acquire information about whole population and -
individual strata
Precision increased if variability within strata is smaller -
(homogenous) than between strata
Disadvantages
Sampling error is difficult to measure -
Different strata can be difficult to identify -
Loss of precision if small numbers in individual strata -
(resolved by sampling proportional to stratum
population)
Cluster sampling
Principle
Whole population divided into groups e.g. -
neighbourhoods
A type of multi-stage sampling where all units at -
the lower level are included in the sample
Random sample taken of these groups (“clusters”) -
Within selected clusters, all units e.g. households -
included (or random sample of these units)
Provides logistical advantage -
Cluster sampling
Section 1 Section 2
Section 3
Section 5
Section 4
Cluster sampling
Advantages
Simple as complete list of sampling units within -
population not required
Less travel/resources required -
Disadvantages
Cluster members may be more alike than those -
in another cluster
and.…)homogenous(
this “dependence” needs to be taken into -
account in the sample size and in the analysis
(“design effect”)
Multiple stage sampling
:Principle
consecutive sampling -
example :
sampling unit = household
1st stage: draw neighborhoods
2nd stage: draw buildings
3rd stage: draw households
Multiple stage sampling
Selecting first-stage units with probability proportional to size
Scenario
Village Frequency Cumulative
- 7 villages in region
- 3 villages to be sampled individuals frequency
(first stage units) 1 30 30
2 400 430
- Total: 6000 individuals
3 1100 1530
- List cumulative 4 500 2030
frequency of all individuals 5 2000 4030
- draw 3 random numbers 6 100 4130
between 1 and 6000 7 1600 6000
e.g. 985, 3830, 4457
Conclusions
Probability samples are the best
Ensure
Representativeness
Precision
…..within available constraints