Lecture - 7.ppt Scaling Techniques
Lecture - 7.ppt Scaling Techniques
Fundamentals and
Comparative Scaling
3-2
3-3
3-4
Comparative Non-comparative
Scales Scales
Semantic Stapel
Likert
Differential
Figure 8.1 Primary Scales of Measurement
Scale
Nominal Numbers Finish
Assigned 7 8 3
to Runners
Interval Performance
Rating on a 8.2 9.1 9.6
0 to 10 Scale
15.2 14.1 13.4
Ratio Time to
Finish, in
Table 8.1 Primary Scales of Measurement
Scale Basic Common Marketing Permissible Statistics
Characteristics Examples Examples Descriptive Inferential
Nominal Numbers identify Social Brand nos., Percentages, Chi-square,
& classify objects Security nos., store types mode binomial test
numbering of
football
players
Ordinal Nos. indicate the Quality Preference Percentile, Rank-order
relative positions rankings, rankings, median correlation,
of objects but not rankings of market Friedman
the magnitude of teams in a position, ANOVA
differences tournament social class
between them
Interval Differences Temperature Attitudes, Range, mean, Product-
between objects (Fahrenheit, opinions, standard moment
can be compared, Celsius) index nos. deviation correlation,
zero point is t tests,
arbitrary regression
A paired comparison
taste test
Figure 8.4 Preference for Toothpaste Brands
Using Rank Order Scaling
Instructions: Rank the various brands of toothpaste in order of
preference. Begin by picking out the one brand that you like most
and assign it a number 1. Then find the second most preferred
brand and assign it a number 2. Continue this procedure until you
have ranked all the brands of toothpaste in order of preference.
The least preferred brand should be assigned a rank of 10.
No two brands should receive the same rank number.
The criterion of preference is entirely up to you. There is no right
or wrong answer. Just try to be consistent.
Figure 8.4 Contd.
Instructions
On the next slide are eight attributes of bathing soaps.
Please allocate 100 points among the attributes so that
your allocation reflects the relative importance you
attach to each attribute. The more points an attribute
receives, the more important the attribute is. If an
attribute is not at all important, assign it zero points. If
an attribute is twice as important as some other
attribute, it should receive twice as many points.
Figure 8.5 Contd.
Form
Average Responses of Three Segments
Attribute Segment
I Segment II Segment III
8 2 4
1. Mildness 2 4 17
2. Lather 3 9 7
3. Shrinkage 53 17 9
4. Price 9 0 19
5. Fragrance 7 5 9
6. Packaging 5 3 20
7. Moisturizing 13 60 15
8. Cleaning Power
Sum
100 100 100
3-14
Version 2
Probably the worst - - - - - - -I - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - Probably the best
0 10 20 30 40 50 60 70 80 90
100
Version 3
Very bad Neither good Very good
nor bad
Probably the worst - - - - - - -I - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - - - - - - - -Probably the best
0 10 20 30 40 50 60 70 80 90
100
3-16
Likert Scale
The Likert scale requires the respondents to indicate a degree of agreement or
disagreement with each of a series of statements about the stimulus objects.
disagree
SEARS IS:
Powerful --:--:--:--:-X-:--:--: Weak
Unreliable --:--:--:--:--:-X-:--: Reliable
Modern --:--:--:--:--:--:-X-: Old-fashioned
Stapel Scale
The Stapel scale is a unipolar rating scale with ten categories
numbered from -5 to +5, without a neutral point (zero). This scale
is usually presented vertically.
SEARS
+5 +5
+4 +4
+3 +3
+2 +2X
+1 +1
HIGH QUALITY POOR SERVICE
-1 -1
-2 -2
-3 -3
-4X -4
-5 -5
Scale Evaluation
Figure 9.5
Scale Evaluation
Measurement Accuracy
The true score model provides a framework for
understanding the accuracy of measurement.
XO = XT + XS + XR
where
Reliability
Reliability can be defined as the extent to which
measures are free from random error, XR. If XR = 0,
the measure is perfectly reliable.
In test-retest reliability, respondents are
administered identical sets of scale items at two
different times and the degree of similarity between
the two measurements is determined.
In alternative-forms reliability, two equivalent
forms of the scale are constructed and the same
respondents are measured at two different times,
with a different form being used each time.
3-24
Reliability
Internal consistency reliability determines the
extent to which different parts of a summated scale
are consistent in what they indicate about the
characteristic being measured.
In split-half reliability, the items on the scale are
divided into two halves and the resulting half scores
are correlated.
The coefficient alpha, or Cronbach's alpha, is the
average of all possible split-half coefficients resulting
from different ways of splitting the scale items. This
coefficient varies from 0 to 1, and a value of 0.6 or
less generally indicates unsatisfactory internal
consistency reliability.
3-25
Validity
The validity of a scale may be defined as the extent
to which differences in observed scale scores reflect
true differences among objects on the characteristic
being measured, rather than systematic or random
error. Perfect validity requires that there be no
measurement error (XO = XT, XR = 0, XS = 0).
Content validity is a subjective but systematic
evaluation of how well the content of a scale
represents the measurement task at hand.
Criterion validity reflects whether a scale performs
as expected in relation to other variables selected
(criterion variables) as meaningful criteria.
3-26
Validity
Construct validity addresses the question of what
construct or characteristic the scale is, in fact,
measuring. Construct validity includes convergent,
discriminant, and nomological validity.
Convergent validity is the extent to which the
scale correlates positively with other measures of the
same construct.
Discriminant validity is the extent to which a
measure does not correlate with other constructs
from which it is supposed to differ.
Nomological validity is the extent to which the
scale correlates in theoretically predicted ways with
measures of different but related constructs.
3-27