LBYACST [Lecture Notes] (3)
LBYACST [Lecture Notes] (3)
Data sets
● The data collected in a particular study
Elements, Variables, and Observations
Elements
● Entities on which data are collected
● Name of companies
Variable
● Characteristics of interest for elements
Observation
● Set of measurements collected for particular elements
● Measurements collected on each variable for every elements in a study provide the data
N observations
● Observations with n elements
● A data set with n elements contains n observations
Business analytics begins with a data set (a simple collection of data or a data file) or commonly
with a database (a collection of data files that contain information on people, locations, and so
on)
Categorical data are labels or names used to identify an attribute of each elements
● Labels or names used to identify attribute of each elements
● Often referred to as qualitative data
● Use either nominal or ordinal scale of measurement
● Can be numeric or non numeric
● Appropriate statistical analyses …
Quantitative data
● Indicate how many or how much
○ Discrete - measuring how many (use whole numbers or integers)
○ Continuous - measuring how much (can measure infinite dimension)
● Always numeric
● Ordinary arithmetic operations are meaningful for computations
Scales of measurement
Types of data palagay pic omg hindi ko na ss surri may nagtext kasi
*** panooding q recording if may time pala ako mamaya
Cross-sectional data
● Data are collected at the same or approximately the same point in time
● Example luh oo ganyan yan shea bilis
Time-series data
● Data are collected over several time periods
● Example: Number of building permits issued in the last 36 months
Data acquisition considerations
● Time requirement
○ Searching for information can be time-consuming
○ Information may no longer be useful by the time it is available
● Cost of acquisition
○ Organizations often charge for information even when it is not their primary
business entity
● Data errors
○ Using any data that happens to be available or was acquired with little care can
lead to misleading information
2 branches of statistics
● Descriptive Statistics
○ Summarizes and organizes data to describe its main features.
○ Provides a snapshot of the data without making predictions or inferences.
○ Typical methods are: tabular, graphical, and numerical methods.
● Inferential statistics
○ Uses a sample of data to make generalizations, predictions, or inferences about
a larger population
○ Involves hypotheses testing and estimation
Statistical Inference
● Population
○ The set of all elements of interest in a particular study
● Sample
○ Subset of population
● Statistical inference
○ Process of using data obtained from a sample to make estimates and test
hypotheses about the characteristics of a population
● Census
○ Collecting data for a population
● Sample survey
○ Collecting data for a sample