2. Types of Data_students
2. Types of Data_students
• Dataset
• Qualitative data
1. Nominal data Dataset
2. Ordinal data
• Quantitative data Qualitativ Quantitati
e data ve data
1. Interval data
2. Ratio data Nominal Ordinal Interval
Ratio data
data data data
Qualitative data
• Qualitative data provides information about the quality of
an object or information which cannot be measured.
• For example, if we consider the quality of performance of
students in terms of ‘Good’, ‘Average’, and ‘Poor’, it falls
under the category of qualitative data.
• Also, name or roll number of students are information that
cannot be measured using some scale of measurement.
So, they would fall under qualitative data. Qualitative data
is also called categorical data.
• E.g. Human behavior, intentions, attitudes, experience,
etc.,
Qualitative Variables: Sometimes referred to as “categorical”
variables, these are variables that take on names or labels and
can fit into categories. Examples include:
• Eye color (e.g. “blue”, “green”, “brown”)
• Gender (e.g. “male”, “female”)
• Breed of dog (e.g. “lab”, “bulldog”, “poodle”)
• Level of education (e.g. “high school”, “Associate’s degree”,
“Bachelor’s degree”)
• Marital status (e.g. “married”, “single”, “divorced”)
Dataset
Qualitativ Quantitati
e data ve data
1. Nominal data
2. Ordinal data
• Nominal data is one which has no numeric value, but
a named value.
• It is used for assigning named values to attributes.
• Nominal values cannot be quantified.
• Examples of nominal data are:
1. Blood group: A, B, O, AB, etc.
2. Nationality: Indian, American, British, etc.
3. Gender: Male, Female, Other
Nominal data
• Nominal data is one which has no numeric value, but a
named value.
• It is used for assigning named values to attributes.
• Nominal values cannot be quantified.
• Nominal data is a type of categorical data that
represents labels or names without any inherent order
or ranking. It is used to classify data into distinct
categories that are mutually exclusive and exhaustive.
Characteristics of Nominal Data
•While ranks can be compared (greater or lesser), arithmetic operations like addition or
subtraction are not meaningful.
•Example: "Very Happy" - "Neutral" doesn’t make sense numerically.
Characteristic of ordinal data
•Qualitative with Order:
•Although ordinal data can be numeric (e.g., 1 = Poor, 2 = Fair, 3 = Good), the numbers are
labels, not actual values.
•Analysis:
•Measures like median and percentiles are appropriate.
•Cannot calculate meaningful averages (mean).
•Visualization:
•Best visualized with bar charts, stacked bar charts, or histograms.
• Examples of Ordinal Data
1.Rankings:
1. Movie ratings: Excellent, Good, Average, Poor.
2. Competition placement: 1st, 2nd, 3rd.
2.Scales:
1. Likert scale in surveys: Strongly Agree, Agree, Neutral, Disagree,
Strongly Disagree.
3.Grades:
1. Academic performance: A, B, C, D, F.
4.Socioeconomic Levels:
1. Low income, middle income, high income.
• Difference Between Nominal and Other Data
Types
1.Nominal vs. Ordinal:
1.Nominal: Categories have no order (e.g., Colors: Red, Green,
Blue).
2.Ordinal: Categories have a logical order (e.g., Size: Small,
Medium, Large).
• Difference Between Ordinal and Other Data Types
1.Ordinal vs. Nominal:
1.Ordinal: Has order (e.g., Small, Medium, Large).
2.Nominal: No order (e.g., Red, Blue, Green).
2.Ordinal vs. Interval/Ratio:
1.Ordinal: No equal intervals or true zero point (e.g.,
Happiness levels).
2.Interval/Ratio: Numeric, ordered, with equal intervals (e.g.,
Temperature, Age).
Operations on Nominal data
1. Counting (Frequency)
•Description: You can count the occurrences of each rank or
category.
•Example: For customer satisfaction ratings (Very Unsatisfied,
Unsatisfied, Neutral, Satisfied, Very Satisfied):
•Very Satisfied: 10
•Satisfied: 15
•Neutral: 5
2. Median
•Description: The middle value or rank can be identified when the data is ordered.
•Example: For rankings: Very Unsatisfied, Unsatisfied, Neutral, Satisfied, Very
Satisfied:
•If the dataset has 9 responses in this order:
Very Satisfied, Satisfied, Neutral, Neutral, Neutral, Satisfied, Very Satisfied,
Unsatisfied, Neutral
•After ordering: Very Unsatisfied, Unsatisfied, Neutral, Neutral, Neutral,
Satisfied, Satisfied, Very Satisfied, Very Satisfied
•The median is Neutral (the 5th value in the ordered list).
3 Percentiles and Quartiles
•Description: You can divide the data into percentiles or quartiles based
on rank.
•Example: For 100 students ranked on performance (Poor, Average,
Good, Excellent):
•The top 25% (4th quartile) might fall in the Excellent category.
4. Mode:
Description: The most frequently occurring category can be identified.
Example: In a survey with responses like Neutral, Neutral, Satisfied
5. Ordering:
Description: Sorting ordinal data in ascending or descending order is
meaningful.
Example: Sorting rankings of hotels by star ratings (1-star, 2-star, 3-star,
4-star, 5-star) in descending order., Satisfied, Neutral, the mode is
Neutral.
Quantitative data
• Quantitative data relates to information about the
quantity of an object – hence it can be measured. For
example, if we consider the attribute ‘marks’, it can be
measured using a scale of measurement.
• Quantitative data is data that represents measurable
quantities or numerical values. Quantitative data is also
termed as numeric data. There are two types of
quantitative data:
1. Interval data
2. Ratio data
Characteristics of Quantitative data
1.Numerical Nature:
1. Quantitative data consists of numbers or quantities.
2. Example: Height (170 cm), weight (65 kg), test scores (95%).
2.Measurable:
1. Represents measurable attributes like length, time, temperature, or cost.
3.Arithmetic Operations:
1. You can perform mathematical operations like addition, subtraction, and averaging.
4.Two Subtypes:
1. Discrete Data: Finite or countable values (e.g., number of students in a class).
2. Continuous Data: Infinite or uncountable values within a range (e.g., time taken to
complete a task).
5.Objective:
1. Based on concrete measurements, not subjective interpretations.
6.Visualization:
1. Represented using histograms, scatter plots, line graphs, and box plots.
Quantitative
Data
Dataset
Qualitativ Quantitati
e data ve data
Arithmetic Operations:
•All mathematical operations, including addition, subtraction, multiplication, and division, are valid.
•Example: A person weighing 60 kg is twice as heavy as someone weighing 30 kg.
•Equal Intervals:
•The intervals between values are consistent and meaningful.
•Example: The difference between 20°C and 30°C is the same as between 50°C and 60°C when using Kelvin.
Ratios Are Meaningful:
•Because of the true zero, ratios can be interpreted.
•Example: A temperature of 40 Kelvin is twice as hot as 20 Kelvin.
Continuous or Discrete:
•Can take any value within a range (continuous) or specific countable values (discrete).
•Continuous Example: Height (e.g., 170.5 cm).
•Discrete Example: Number of students in a class (e.g., 20 students).
Visualization:
•Best represented using histograms, line graphs, scatter plots, or box plots.
Examples of Ratio Data
1.Physical Measurements:
1.Height, weight, distance, speed, temperature in Kelvin.
2.Financial Data:
1.Income, revenue, expenses.
3.Demographic Data:
1.Age, number of children, years of education.
Overview of operations for ratio
data:
• Percentiles and quartiles can be calculated on ratio
data because ratio data is quantitative, continuous (or
sometimes discrete), and allows for meaningful
ordering of values.
• Apart from the approach detailed above, attributes can also be
categorized into types based on a number of values that can be
assigned.
• The attributes can be either discrete or continuous based on this
factor.
• Discrete attributes can assume a finite or countably infinite
number of values.
• Nominal attributes such as roll number, street number, pin code,
etc. can have a finite number of values whereas numeric
attributes such as count, rank of students, etc. can have
countably infinite values.
• A special type of discrete attribute which can assume two values
only is
called binary attribute.
• Examples of binary attribute include: male/ female,
positive/negative, yes/no, etc.
• Continuous attributes can assume any possible value which is a
Discrete Data
Continuous data can take any value within a given range, including fractions and
decimals. It represents measurements rather than counts.
Key Characteristics:
• Measurable: Values can be measured, not just counted.
• Infinite Possibilities: Can include any value within a range.
• Fractions and Decimals Allowed: Examples include 5.5, 10.75, etc.
• Examples:
• Height of a person (e.g., 170.5 cm, 165.2 cm).
• Weight of an object (e.g., 65.3 kg, 72.8 kg).
• Time taken to complete a task (e.g., 12.5 seconds, 14.8 seconds).
• Common Visualizations:
Histograms, line graphs, scatter plots, box plots.
Common Applications
1.Discrete Data:
1.Business: Counting products sold, number of employees.
2.Sports: Counting goals, fouls, or wins.
2.Continuous Data:
1.Healthcare: Measuring patient blood pressure, weight, or
height.
2.Physics: Recording speed, temperature, or distance.
Note: