0% found this document useful (0 votes)
9 views

Lesson3 - Data Visualization and Pivot Table

The document discusses the importance of data visualization, highlighting its role in understanding data, spotting errors, and aiding decision-making. It covers various visualization methods such as tables, charts, and dashboards, along with principles for effective design and the concept of data-ink ratio. Additionally, it introduces tools like PivotTables and emphasizes the use of key performance indicators (KPIs) for measuring success in business intelligence.

Uploaded by

natehilado
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Lesson3 - Data Visualization and Pivot Table

The document discusses the importance of data visualization, highlighting its role in understanding data, spotting errors, and aiding decision-making. It covers various visualization methods such as tables, charts, and dashboards, along with principles for effective design and the concept of data-ink ratio. Additionally, it introduces tools like PivotTables and emphasizes the use of key performance indicators (KPIs) for measuring success in business intelligence.

Uploaded by

natehilado
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 66

Data Visualization and

Pivot Table
CONTENTS

3.1.Why do we need visualization?


3.2 Data-Ink Ratio
3.3 Tables
3.4 Charts
3.5 BI and Data Visualizations
Why do we need Visualization?

▹ Help us understand and manage important data


streams that are flowing around us
▹ Move easily from "the big picture" to the minute details
at will
▹ Allow us to quickly spot errors in the data
▹ Enable us to perceive things we were not considering or
expecting and help us to better deal with the
unexpected
▹ Have deep simplicity
▹ Have some level of interactivity and qualities of good
collaborators
Why do we need Visualization? (cont’d)

▹ Help a spectrum of people make better real-


life, day-to-day decisions
▹ Help viewers gain new insights and enjoy
exploring data
▹ Have some flexibility and be able to adapt
and adjust to the changing needs and
contexts of the user
Data-Ink Ratio

▹ Data-ink ratio: Measures the proportion of


what Tufte terms “data-ink” to the total
amount of ink used in a table or chart.
▸ Edward R. Tufte first described the data-ink ratio.
▸ Helpful for creating effective tables and charts for data visualization.
■ Data-ink - Ink used in a table or chart that is necessary to convey the
meaning of the data to the audience.
■ Non-data-ink - Ink used in a table or chart that serves no useful purpose in
conveying the data to the audience.
Data-Ink Ratio (cont’d)
Tables

• Tables should be used when:


1. The reader needs to refer to specific
numerical values.
2. The reader needs to make precise
comparisons between different values
and not just relative comparisons.
3. The values being displayed have
different units or very different
magnitudes.
Tables (cont’d)
Table 3.3 - Table showing Exact Values for Costs and Revenues
by Month for Gossamer Industries

Figure 3.5 - Line Chart of Monthly Costs and Revenues at Gossamer


Industries
Tables (cont’d)

Figure 3.6 - Combined Line Chart and Table for Monthly Costs
and Revenues at Gossamer Industries
Tables (cont’d)

Table 3.4 - Table displaying Headcount, Costs,


and Revenues at Gossamer Industries
Tables (cont’d)

• Table Design Principles:


▸ Avoid using vertical lines in a table unless they are necessary for clarity.
▸ Horizontal lines are generally necessary only for separating column titles
from data values or when indicating that a calculation has taken place.
Tables (cont’d)

Figure 3.7 - Comparing different Table Designs


Tables (cont’d)

Table 3.5 - Larger Table Showing Revenues by


Location for 12 Months of Data
Charts

• Charts (or graphs): Visual methods of displaying


data.
• Scatter chart: Graphical presentation of the
relationship between two quantitative variables.
▸ Trendline – A line that provides an approximation of the
relationship between the variables.
• Line chart: A line connects the points in the chart.
▸ Useful for time series data collected over a period of time.
(minutes, hours, days, years, etc.)
Charts (cont’d)

Table 3.8 - Sample Data for the San Francisco


Electronics Store
Charts (cont’d)

Figure 3.14 - Scatter Chart for the San Francisco


Electronics Store
Charts (cont’d)

Table 3.9 - Monthly Sales Data of Air Compressors at


Kirkland Industries
Charts (cont’d)

Figure 3.15 - Scatter Chart and Line Chart for Monthly Sales
Data at Kirkland Industries
Charts (cont’d)

Figure 3.16 - Line Chart of Regional Sales Data at Kirkland


Industries
Charts (cont’d)

• Sparkline - Special type of line chart


▸ Minimalist type of line chart that can be placed directly into a cell in Excel.
▸ Contain no axes; they display only the line for the data.
▸ Take up very little space, and they can be effectively used to provide
information on overall trends for time series data.
Charts (cont’d)

Figure 3.17 - Sparklines for the Regional Sales Data at


Kirkland Industries
Charts (cont’d)

• Bar Charts: Use horizontal bars to display


the magnitude of the quantitative variable.
• Column Charts: Use vertical bars to display
the magnitude of the quantitative variable.
▸ Bar and column charts are very helpful in making comparisons between
categorical variables.
Charts (cont’d)

Figure 3.18 - Bar Charts for Accounts Managed Data

Gentry manages the greatest number of accounts and Williams the fewest.
Charts (cont’d)

Figure 3.19 - Sorted Bar Chart for Accounts Managed Data


Charts (cont’d)

Figure 3.20 - Bar Chart with Data Labels for Accounts Managed
Data
Charts (cont’d)

• Pie charts: Common form of chart used to


compare categorical data.
• Bubble chart: Graphical means of
visualizing three variables in a two-
dimensional graph.
▸ Sometimes a preferred alternative to a 3-D graph.

• Heat map: A two-dimensional graphical


representation of data that uses different
shades of color to indicate magnitude.
Charts (cont’d)

Figure 3.21 - Pie Chart of Accounts Managed


Charts (cont’d)

Table 3.11 - Sample Data on Billionaires


per Country
Charts (cont’d)

Figure 3.23 - Bubble Chart Comparing Billionaires by Country


with Data Labels added
Charts (cont’d)

Figure 3.24 - Heat Map and Sparklines for Same-Store


Sales Data
Charts (cont’d)

• Additional charts for multiple variables


▸ Stacked column chart: Allows the reader to compare the relative values of
quantitative variables for the same category in a bar chart.
▸ Clustered column (or bar) chart: An alternative chart to stacked column
chart for comparing quantitative variables.
▸ Scatter chart matrix: Useful chart for displaying multiple variables.
Charts (cont’d)

Figure 3.25 - Stacked Column Chart for Regional Sales Data for
Kirkland Industries
Charts (cont’d)

Figure 3.26 - Comparing Stacked, Clustered and Multiple


Column Charts for the Regional Sales Data for Kirkland
Industries
Charts (cont’d)

Table 3.12 - Data for New York city Subboroughs


Tables (cont’d)

• Crosstabulation: A useful type of table for


describing data of two variables.
• PivotTable: A crosstabulation in Microsoft
Excel.
Tables (cont’d)

Table 3.6 - Quality Rating and Meal Price for


300 Los Angeles Restaurants
Tables (cont’d)

Table 3.7 - Crosstabulation of Quality Rating and Meal Price for 300 Los
Angeles Restaurants

• The greatest number of restaurants in the sample (64) have a very good rating and
a meal price in the $20–29 range.
• Only two restaurants have an excellent rating and a meal price in the $10–19
range.
• The right and bottom margins of the crosstabulation give the frequency of quality
rating and meal price separately.
Tables (cont’d)

Figure 3.8 - Excel Worksheet Containing Restaurant Data


Charts (cont’d)

• PivotChart: To summarize and analyze data


with both a crosstabulation and charting,
Excel pairs PivotCharts with PivotTables.
Charts (cont’d)

Figure 3.28 - PivotTable and PivotChart for the Restaurant Data


Big Data: A Business Example

▹ How BI Can Answer Tough Customer


Questions

FIGURE 3.7
Measuring Success

▹ Metrics
▸ Measurements that evaluate results to
determine whether a project is meeting its
goals
▸ Common Types:
▸ KPIs – Key Performance Indicators
▸ Critical success factors (CSFs)
Critical Success Factors

▹ These are crucial steps companies perform to


achieve their goals and objectives and implement
their strategies. (CSFs for a firm that want to be
best).
▸ Create high-quality products
▸ Retain competitive advantages
▸ Reduce product costs
▸ Increase customer satisfaction
▸ Hire and retain the best professionals
Key Performance Indicators (KPIs)

The quantifiable metrics a company uses to evaluate


progress toward critical success factors
▸ Turnover rates of employees

▸ Number of product returns

▸ Number of new customers

▸ Average customer spending


Measuring Success

▹ Efficiency Metrics ▹ Effectiveness Metrics

▹ Throughput ▹ Usability
▹ Transaction Speed ▹ Customer
▹ System Availability Satisfaction
▹ Information ▹ Conversion Rates
Accuracy ▹ Financial Goal
▹ Response Time Achievement
Visual Business Intelligence

▹ Infographics Emerson’s Food


▸ Charts and pictures representing data sets Waste
▸ Easily understood comparisons Infographic

▹ Data Visualization and Visualization Tools


▸ Sophisticated mathematical models
▸ Ranging from time series graphs to webs of networks and
clusters formed from thousands of points
▸ Identifying trends, emerging markets etc.
▹ Intelligence Dashboard
▸ Key decision data electronically displayed in multiple charts,
graphs and other visualization tools
▸ Dynamically updated in real time
Digital Dashboards

Integrates information from multiple components and presents


it in a unified display

FIGURE 2.12
Donuts

Instead of a pie chart, a donut


may be used if there are too
many categories.
Stacked Continuous

Stacked Continuous may be used for dimensions with different


proportions to be compared to one another over time.
Parallel Coordinates Plot

• Parallel coordinates plot: Chart for examining


data with more than two variables.
▸ Includes a different vertical axis for each variable.
▸ Each observation is represented by drawing a line on the parallel
coordinates plot connecting each vertical axis.
▸ The height of the line on each vertical axis represents the value taken by
that observation for the variable corresponding to the vertical axis.

• Treemap: Useful for visualizing hierarchical data


along multiple dimensions.
Parallel Coordinates Plot

Figure 3.29 - Parallel Coordinates Plot for


Baseball Data
Treemap

Figure 3.30 - SmartMoney’s Map of the Market as An Example


of A Treemap
GIS Charts

• Geographic Information systems charts


▸ Geographic Information Systems (GIS): A system that merges maps and
statistics to present data collected over different geographies.
■ Helps in interpreting data and observing patterns.
GIS Charts (cont’d)

Figure 3.31 - GIS Chart for Cincinnati Zoo Member Data


Regions by Color

▹ Choropleth maps are the most common way to map


regional data. Based on some metric, regions are colored
following a defined color scale.
Space and Time Animation

▹ One of the best ways to visualize changes over space and


time. It shows the changes as they happen on a single
interactive map.
CONTENTS

4.1 Example of Advance Data Visualizations


4.2 Data Dashboards
4.3 Tableau Public as Tool for Data Visualization
Data Dashboards

• Data dashboard: Data visualization tool that


illustrates multiple metrics and automatically
updates these metrics as new data become
available.
• Key performance indicators (KPIs) in
dashboards:
▸ Automobile dashboard - Current speed, Fuel level, and oil pressure
▸ Business dashboard - Financial position, inventory on hand, customer service
metrics
Data Dashboards (cont’d)

• Principles of effective data dashboards


▸ Should provide timely summary information on KPIs that are important to
the user.
▸ Should present all KPIs as a single screen that a user can quickly scan to
understand the business’s current state of operations.
Data Dashboards (cont’d)

• Principles of effective data dashboards


(cont’d.)
▸ The KPIs displayed in the data dashboard should convey meaning to its user
and be related to the decisions the user makes.
▸ A data dashboard should call attention to unusual measures that may
require attention.
■ Color should be used to call attention to specific values to differentiate
categorical variables, but the use of color should be restrained.
Data Dashboards (cont’d)

Figure 3.32 - Data Dashboard for the Grogan Oil Information


Technology Call Center
Layers of Info in Dashboard

Graphical Data Summarized Data Transactional Data

• Provides a graphical • Consists of dimensional • Lets users view


view of performance data that lets users detailed data, such as
metrics, usually in the navigate the data by invoices, shipments, or
form of charts and subject transactions, stored in
alerts. data warehouses or
operational systems

62
Layers of Info in Dashboard (cont’d)

Graphical Data • Used by Executives to Monitor

• Used by Analysts to
Dimensional data
Analyze

• Used by Workers
Transactional Data
to extract Detail

63
Types of Dashboards

Operational
Strategic Dashboards Tactical Dashboards
Dashboards
• Communicate • Used to review • Enables front-line
strategy and and benchmark workers to
review the performance manage and
performance at of the company, control
monthly strategy while managers operational
or operational use them to processes using
review meetings monitor and detailed data
• Emphasis on optimize • Emphasis on
Management processes Monitoring
• Emphasis on
Analysis

64
SUMMARY

3.1.Why do we need visualization?


3.2 Data-Ink Ratio
3.3 Tables
3.4 Charts
3.5 BI and Data Visualizations
thanks!
ANY QUESTIONS?

You can reach me at:


ariane_________torres@un
o-r.edu.ph

You might also like