Statistics Notes
Statistics Notes
Understanding Data
Data Set – A collection of numbers or values. For example,
a teacher recording students’ test scores.
Elements – The individuals or objects being studied. In this
case, the students.
Observation – A recorded measurement for each element.
For example, each student’s test score.
Context Matters! – Data is meaningless without
understanding what it represents and who it applies to.
Knowing the background of the data helps in making sense
of patterns and insights.
Quantitative Variables
Quantitative variables represent measurable numerical
values that can be counted or measured.
Example: A person's height, weight, income, or test score.
These variables can be used for arithmetic operations
like addition, subtraction, and averaging.
Types of Quantitative Variables:
o Discrete Variables: Take on whole number values
(e.g., number of siblings, number of pets).
o Continuous Variables: Can take any value within a
given range (e.g., height, weight, temperature).
Examples of Quantitative Variables:
Variable Type Examples
Age Continu 5.2 years, 21 years,
ous 67.8 years
Height Continu 5'8", 6'2", 170 cm
ous
Number of Discrete 0, 1, 3, 5
Pets
Monthly Continu $2,500, $4,700.50,
Salary ous $6,100
Number of Discrete 3, 4, 5, 7
Classes
Level of Measurement
The level of measurement determines how data can be
analyzed and what mathematical operations make sense.
Level Definition Examples
Nomin Data is categorized but has no Gender, Religion, Eye
al meaningful order Color
Ordin Data is categorized with a Satisfaction Levels
al meaningful order, but (Poor, Fair, Good),
differences between values are Education Level
not meaningful
Interv Data is ordered, has Temperature (°C, °F),
al meaningful differences, but no IQ Scores
true zero
Ratio Data is ordered, has Age, Income, Weight
meaningful differences, and
has a true zero
Example Question
Transportation Safety
The chart shows the number of job-related injuries in each of the
transportation industries in 1998.
Industry Number of injuries
Railroad 4520
Intercity bus 5100
Subway 6850
Trucking 7144
Airline 9950
1. What are the variables that we are studying?
Looking at the table, we can see that we have two variables; type
of industry and number of injuries.
2. Categorize each variable as quantitative or
qualitative.
The type of industry, of course, is a qualitative variable, as the
values are names for transportation. At the same time, the
number of job-related injuries is quantitative, as the values are
numbers.
3. Categorize each quantitative variable as discrete or
continuous.
The number of job-related injuries is discrete.
4. Identify the level of measurement for each variable.
The type of industry is nominal, and the number of job-related
injuries is a ratio.
5. The railroad is shown as the safest transportation
industry. Does that mean railroads have fewer
accidents than the other industries? Explain.
This question makes you think about what the number means to
you. The railroads do show fewer job-related injuries; however,
there may be other things to consider. For example, railroads
employ fewer people than the other transportation industries in
the study.
6. From the information given, comment on the
relationship between the variables.
We can see that the railroads have the fewest job-related
injuries. In contrast, the airline industry has the most job-related
injuries (more than twice those of the railroad industry). The
numbers of job-related injuries in the subway and trucking
industries are fairly comparable.
Bottom line: always look at data and see what you can see
behind, how they are related, and how they compare to each
other.