Lecture Week 7- Plotting and Visualization
Lecture Week 7- Plotting and Visualization
Visualization
While many people use ‘graph’ and ‘chart’ interchangeably, they are
different visuals. Charts are tables, diagrams or pictures that organize
large amounts of data clearly and concisely(not just those based on x, y
coordinates) e.g. Pie chart, Bar chart, Gantt chart, and Flow chart . People
use charts to interpret data, make predictions, summarize and
communicate insights.
Graphs, however, focus on raw data and show patterns, trends,
correlations time-series data, and statistical relationships and
distributions. Graphs uses a coordinate system like x, y coordinates to
show numerical relationships. e.g. Line graph (chart), Scatter Plot, Area
graph (chart).
Line chart
Line chart (Run Chart in SPC) illustrates
how related data changes over a specific
period of time. One axis might display a
value, while the other axis shows the
timeline. Line charts are useful for
illustrating trends such as temperature
changes during certain dates. You should
use it when you graph a continuous data
set. They're also helpful for measuring
how different groups relate to each other.
You can use it to show many different
categories of data. E.g., a business might
use this graph to compare sales rates for
different products or services over time.
Used usually to visualize data in
forecasting analysis (Time series analysis)
Area chart:
An area chart is basically a line chart, but the
space between the x-axis and the line is filled
with a color or pattern. Area charts show a
change in one or more quantities over a certain
period of time. They often help when displaying
trends and patterns. Similar to a line chart, area
charts use dots connected by a line. However,
an area chart involves coloring between the line
and the horizontal axis. You can use several
lines and colors between each one to show how
multiple quantities add up to a whole. For
example, a retailer might use this method to
display the profits of different types of a
product over the same timeframe. They work
best for big differences between data sets and
help visualize big trends. The area betweentwo lines represents
the contribution of the second variable on top
of the first.
How many variables
in this graph?
Bar chart
Column Chart category Numeric (counts, percentages, Sales,.. but column charts are best for negative data. Use to show: Trends, Compare,
Distributions.
Stacked Bar Chart category Numeric (counts, percentages, Sales,.. compare many different items and show the composition of each item you’re
comparing. Use to show: Compare, Compositions.
Histogram range values illustrate statistics: Use to show: Distributions.
Dual-Axis Chart Shared x-axis Two datasets one numeric and the other categorical to visualize a correlation or the lack thereof between these two data sets. Use
to show: Trends.
Scatter plot numeric numeric how much one variable affects the other. Use to show: Compare,
Distributions, Relationships.
Pictograph category Numeric (counts, percentages, Sales,..) display data in a highly visual presentation
Flowchart NA NA help organize the steps
Pie chart NA NA presents the different parts of a whole (Categorical/Numerical). Use to show:
Compare, Compositions.
Gantt chart Timeline Tasks length of the bar depends on the start and end date of the task (Duration of
the task)
Waterfall chart category Numeric (counts, percentages, Sales,..) reflect variance over time. Use to show: Compositions.
Bubble Chart numeric Numeric. A third numerical variable represented by size) Show relation between three numerical variables. Use to show: Relationships.