WELCOME
ABOUT THE COURSE
Part 1
• Connecting to Data Sources
• Exploring and Analysing Data
• Sharing Insights
• Understanding Tableau Concepts
• Quizzes
• Practice Exam 2
Part 2
• Connect to and Transform Data
• Exploring and Analysing Data
• Create Content
• Publish and Manage Content
• Quizzes
• Practice Exam 2
ABOUT THE DS EXAM
• First exam in Tableau’s skill level hierarchy.
• Recommended 3+ months experience.
• 45 multiple Choice, multiple answer questions.
• 60-minute time limit.
• 750/1000 required to pass.
Exam Outline
Connect To and Prepare Data – 23%
Explore and Analyse Data – 37%
Sharing Insights – 25%
Understanding Tableau Concepts – 15%
ABOUT THE DA EXAM
• Second exam in Tableau’s skill level hierarchy.
• 40-45 multiple Choice, multiple answer questions.
• 8-10 hands on lab questions.
• 120-minute time limit.
• 750/1000 required to pass.
Exam Outline
Connect To and Transform Data – 24%
Explore and Analyse Data – 41%
Create Content – 26%
Publish and Manage Content – 9%
CONNECTING TO
D ATA S O U R C E S
• We need data!
• We need to understand the different data sources
you can connect to.
LIVE CONNECTIONS
VS EXTRACTS
Understanding the difference between live connections and
extracts is vital. Both live connections and extracts have
their advantages.
Live Connections:
• Real time data
• Tableau relies on database for all queries
Extracts:
• Snapshots of data saved in memory
• Tend to be much faster than live
connections
• Must refresh the extract to retrieve latest
data.
JOINS &
R E L AT I O N S H I P S
• A join combines two data sets together into one table.
• A relationship acts like a join however keeps the tables
separate.
• Relationships are known as the logical layer and joins are
known as the physical layer.
Join Disadvantages:
• Joins are static
• May result in losing data
• Can result in poor performance
UNIONS
• Unions are an extra method of combining two or more
tables in Tableau.
• Achieved by appending rows from one table to another.
• Can only be successful when the tables contain the same
columns.
C H A N G I N G D ATA
TYPES
M O U L E 1 S U M M A RY
• How to connect to a data source
• Live Connections Vs Extracts
• Multiple Connection Data Sources
• Joins & Relationships
• Unions
• Renaming fields
• Aliases
• Assigning Geographical Roles
• Changing data types
• Changing default properties
B A R C H A RT S
• Bar charts are one of the most popular and effective
charts that you can use.
• Bar charts allow us to easily compare a measure over
different categorical data values.
L I N E C H A RT S
• Line charts are used when we want to visualise a
measure over time to see how that measure changes
from point to point.
• To create a line chart, we need a measure and a date
field.
• We can create line charts with multiple measures as
seen in the second chart on the left.
S C AT T E R P L O T S
• Scatterplots are used when we want to show the
relationship between two things to understand if a
relationships exists.
• To create a scatterplot, we need two measures and a
dimension to split the scatterplot to include individual
elements.
MAPS
• Maps provide an effective and intuitive way to represent
geographical data.
• Everyone is familiar with maps which helps users
understand where things are located.
• Maps allow users to see geographical relationships that
might not be obvious when visualised in other ways.
COMBINED AXIS CHART
• Combined axis charts are effective as they allow us to
display two or more data series on a chart the same
axes.
• Combined axis charts are a great space saver whilst
providing comparison between different measures.
Instead of two separate graphs combined axis charts
help identify trends and correlations that might not be
obvious when looking at the data in isolation.
• Overall, combined axis charts allow us to present a more
comprehensive visualisation by integrating multiple
aspects in a single chart.
DUAL AXIS CHART
• Dual-axis charts, similar to combined axis charts, differ
in that they place two bars representing distinct
measures within the same row.
• Again, they are a good way of being able to make
comparisons between two measures.
• Once again, we benefit from saving space by placing two
measures within the same visualisation and it provides a
more interesting and comprehensive visualisation than
comparing the measures in separate graphs.
S TA C K E D B A R C H A RT
• Stacked bar charts are particularly effective when we
want to demonstrate parts within a whole. Each bar
represents the whole and each segment represents and
individual section in proportion to that whole.
• Stacked bar charts can make it easy to compare how the
same breakdown is either similar or different across
different categories.
• Stacked bar charts are another good example of a simple
and effective chart which should be utilized as much as
possible rather than going for something more complex
for no reason.
DENSITY MAPS
• Density maps, which you may also hear named heat maps are
particularly good when we want to demonstrate the distribution
or concentration of data points in a geographical location.
• The density maps visualises with intensity where there are larger
clusters of data points to make it obvious where the most activity
is taking place.
• An important feature of density maps isn’t just the areas of large
clusters but also outliers, we can identify outliers which can be
interesting to analyse.
BINS & HISTOGRAMS
• Bins are defined ranges or intervals that we define as developers
which group numerical values into the ranges or intervals.
• Bins are created before then being used in histograms which are
a visualisation type to show the distribution of data within these
created bins.
• The histogram displays the count of data points that fall into
each bin.
M O D U L E 2 S U M M A RY
• Bar chart
• Line chart
• Scatterplot
• Map
• Combined axis chart
• Dual axis chart
• Stacked bar
• Density map
• Groups
• Sets
• Hierarchies
• Filters
• Manual & computed sorts
• Reference lines
• Quick table calculations
• Bins and histograms
• Calculated fields
• Parameters
• Worksheet totals
M O D U L E 3 S U M M A RY
• Using colour from the marks card
• Configuring fonts
• Formatting marks as shapes
• Configuring viz animations
• Changing size of marks card
• Legends
• Adding worksheets to a dashboards
• Adding interacting elements to a dashboard
• Adding dashboard actions
• Creating device specific dashboards
• Creating a story
• Sharing a workbook
• Viewing and exporting underlying data
DIMENSIONS &
MEASURES
• A value inside a dimension is not dependant on a measure,
whereas the value of the measure is dependent on the
dimension.
• We can't aggregate dimensions however we can aggregate
measures.
• When you drag a dimension onto rows or columns in a view it
creates headers, whereas if you drag a measure into rows or
columns it creates an axis.
M O D U L E 4 S U M M A RY
• Dimensions and Measures
• Discrete and Continuous
• Default Aggregation
PA R T 2
D A T A A N A LY S T E X A M
D ATA Q U A L I T Y
Data Completeness
Data completeness refers to the extent to which a dataset includes
all the necessary and expected information without any missing or
null values. Incomplete data, where certain values are missing can
hinder accurate analysis.
Data Consistency
Data consistency refers to the uniformity and coherence of data
across a dataset or between different datasets. Inconsistent data
may exhibit variations, discrepancies, or contradictions that can
arise from errors, different data sources, or changing data formats.
Data Accuracy
Data accuracy refers to the correctness and precision of data in a
dataset. Accurate data is free from errors or mistakes, and it
represents the real-world information it is intended to describe.
DISCRETE & CONTINUOUS
Discrete
Discrete data consists of distinct, separate values. It often
represents categorical or qualitative information.
Continuous
Continuous data represents a range of values that can be measured
along a continuous scale. It is often used for quantitative or
numerical data.