Intro To Business Analytics
Intro To Business Analytics
Business Analytics
Chapter 1: Introduction to Business Analytics
Instructor: Hamed Qahri-Saremi, PhD
What is Business Analytics
• Business analytics is the data-oriented approach to business problem solving and decision making, at
the intersection of:
– Statistics
– Computer Science
– Information Systems
• Source of decision
insights.
2023
Why Business Analytics:
Data Perspective
https://en.wikipedia.org/wiki/Metric_prefix
https://www.statista.com/statistics/871513/worldwide-data-created/
Why Business Analytics:
Technology Perspective – Evolution of Computing Technology
• Increasingly Expanding Software Solutions Market:
https://www.wsj.com/articles/huangs-law-is-the-new-moores-law-and-e
xplains-why-nvidia-wants-arm-11600488001
Why Business Analytics:
Decision Making Perspective
– Predictive analytics:
• What will happen in the future?
– Predicting unknowns based on knowns.
– Prescriptive analytics:
• What should we do?
– Scenario analysis.
– Turning data-driven recommendations into action also requires thoughtful consideration and organizational
commitment beyond developing descriptive and predictive analytical models.
Descriptive Analytics
• Descriptive Analytics: Describing what we know.
– Gather (data)
– Organize
– Tabulate
– Visualize
– Summarize
• Descriptive information can be presented in several formats (e.g., written reports, tables,
graphs, maps), such as:
– A firm’s marketing expenses and sales
– Financial reports
– Crime rates across regions and time
– COVID Cases: https://coronavirus.jhu.edu/us-map
• Examples
– Identifying customers who are most likely to respond to specific marketing campaigns
– Transactions that are likely to be fraudulent
– Incidence of crime at certain regions and times
– Products that customers are more likely to purchase
Prescriptive Analytics
• Prescriptive Analytics: what scenario/decision/action is the best?
– Optimization and simulation algorithms to provide advice
– Explore several possible actions to find an optimal solution
• Scenario analysis
– Suggest course of action
• Examples
– Scheduling employees’ works hours
– Business Process Simulation and Improvement
– Select a mix of products to manufacture
– Choose an investment portfolio
Business Analytics
Let’s Try…
Example Descriptive? Predictive? Prescriptive?
Which candidate will win the election? X
What price for a product will maximize profit? X
How much money do I need to save each year to have
enough money for retirement? X
How many products were sold last year? X
What’s the best route for the delivery person to drop off
packages to minimize the time needed to deliver all the X
packages?
How will marketing affect the daily sales of a product?
X
What was the average purchase price for new customers last
year? X
Other real-
world
examples?
Types of Data
• An important first step for making decisions is to find the right data and prepare it for
analysis.
– Compilation of facts, figures, or other content
– Numerical and non-numerical
– All types and formats are generated from multiple sources
– Often, we have a large amount of data
– Even small data can give insights
Types of Data
• It is not feasible to collect data that comprise the whole population of all elements of
interest.
– Too expensive
– It is impossible
– Traditional statistical techniques use sample information to draw conclusions about the population.
• Unstructured data
– Do not conform to a pre-defined, row-column format
– Textual
– Multimedia content
– Do not conform to traditional relational database structures
Categorical Numerical
– Differences between categories are meaningless because the actual numbers used may be arbitrary.
• There is no objective way to interpret the difference between instructor quality.
Scales of Measurement
• The Interval Scale
– Data may be categorized and ranked with respect to some characteristic or trait.
– Differences between interval values are meaningful.
• Thus, the arithmetic operations of addition and subtraction are meaningful.
– No “absolute 0” or starting point defined.
• Meaningful ratios may not be obtained (What does mean?)
• For example, consider the Fahrenheit scale of temperature.
– This scale is interval because the data are ranked and differences (+ or −) may be obtained.
– But there is no “absolute 0” (What does 0°F mean?)
– There is one exception:
» The temperature scale in Kelvin is a ratio scale variable because its zero
value is absolute zero (i.e., nothing can be measured at a lower temperature
than 0 degrees Kelvin. No energy from molecular motion (that is, heat
energy) is available for transfer to other systems).
Scales of Measurement
• The Ratio Scale
– The strongest level of measurement.
– Ratio data may be categorized and ranked with respect to some characteristic or trait.
– Differences between interval values are meaningful.
– There is an “absolute 0” or defined starting point. “0” does mean “the absence of …” Thus, meaningful ratios
may be obtained.
– The following variables are measured on a ratio scale:
• General Examples: Weight, Time, and Distance.
• Business Examples: Sales, Profits, and Inventory Levels.
Variables & Scales of Measurement
Example 1.3: The owner of a ski resort gathers data from twenty tweens.
– Music: On your car drive to the resort, which music streaming service was playing?
– Food quality: Q2: On a scale of 1 to 4, rate the quality of the food at the resort (where 1 is poor, 2 is
fair, 3 is good, and 4 is excellent.)
– Closing time: Presently, the main dining area closes at 3:00 pm. What time do you think it should
close?
– Own money spent: How much of your own money did you spend at the lodge today?
Data File Formats
• There are standard file formats.
– Fixed-width format: each column starts and ends in the same place in every row (.txt file).
– Delimited format: a delimiter separates fields, typically a comma (.csv file), or a tab (.tsv file).
– Excel format.
– Extensible Markup Language (XML): structured data, each piece enclosed in a pair of tags, gives
information on what the data are.
– HyperText Markup Language (HTML): structured data with tags, gives information on how to
display the data.