0% found this document useful (0 votes)
33 views

IDS Unit 5 Visualization

Uploaded by

Avyuktha Raju
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views

IDS Unit 5 Visualization

Uploaded by

Avyuktha Raju
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 24

Data Visualization

• Why data visualization?


• Data visualization aims to communicate data clearly
and effectively through graphical representation.
• Data visualization has been used extensively in
many applications for Eg. At work for reporting
managing business operations and tracking progress
of tasks.
• Provide qualitative overview of large data sets
• Search for patterns, trends, structure, irregularities,
relationships among data
• advantage of visualization techniques is to discover
data relationships that are not easily observable
by looking at the raw data. 1
Categorization of visualization methods:

– Pixel-oriented visualization techniques

– Geometric projection visualization techniques

– Icon-based visualization techniques

– Hierarchical visualization techniques

– Visualizing complex data and relations


Pixel-oriented visualization techniques

• A simple way to visualize the value of a dimension is to


use a pixel.
• where the color of the pixel reflects the dimension’s
value.
• For a data set of m dimensions pixel-oriented techniques
create m windows on the screen, one for each
dimension.
• The m dimension values of a record are mapped to m
pixels at the corresponding position in the windows.
• Inside a window, the data values are arranged in some
global order shared by all windows.
• Eg: All Electronics maintains a customer information table,
which consists of 4 dimensions: income, credit_limit,
transaction_volume and age. We analyze the correlation
between income and other attributes by visualization.
• We sort all customers income in ascending order and use this
order to layout the customer data in the 4 visualization
windows as shown in fig.
• The pixel colors are chosen so that the smaller the value, the
lighter the shading.
• Using pixel -based visualization we can easily observe that
credit limit increases as income increases customer whose
income is in the middle range are more likely to purchase more
from All Electronics, these is no clear correlation between
income and age.
5
2.Geometric Projection visualization techniques
Geometric Projection Visualization Techniques
• Visualization of geometric transformations and projections of
the data
• Methods
– Scatterplot and scatterplot matrices
– Landscapes
– Parallel coordinates
– Projection pursuit technique: Help users find meaningful
projections of multidimensional data
– Prosection views
– Hyperslice

7
Scatterplot Matrices
Used by ermission of M. Ward, Worcester Polytechnic Institute

Matrix of scatterplots (x-y-diagrams) of the k-dim. data [total of (k2/2-k) scatterplots]


8
Interpreting Scatter plat matrix:
Leave the diagonal squares. Now, for example, in the below scatter plot. Yield will
be on Y-axis for it left and right sides and it will be on X-axis for top and bottom.

Y-axis: Yield

X-axis: Rainfall

Y-axis: Rainfall

X-axis: Yield
Landscapes
Used by permission of B. Wright, Visible Decisions Inc.

news articles
visualized as
a landscape

• Visualization of the data as perspective landscape


• The data needs to be transformed into a (possibly artificial) 2D spatial
representation which preserves the characteristics of the data
10
Parallel Coordinates
• n equil distant axes which are parallel to one of the screen axes and correspond
to the attributes.
• The axes are scaled to the [minimum, maximum]: range of the corresponding
attribute
• Every data item corresponds to a polygonal line which intersects each of the
axes at the point which corresponds to the value for the attribute

• • •

Attr. 1 Attr. 2 Attr. 3 Attr. k

11
Note
Parallel Coordinates are useful to represent higher dimension of data. X, Y, Z, are
dimensions.

They are represented as vertical lines and the points are noted on those line and mapped.
Example

13
Icon-Based Visualization Techniques
• Visualization of the data values as features of icons

• Typical visualization methods


– Chernoff Faces
– Stick Figures

14
Stick Figure
A census data
figure showing age,
income, gender,
education, etc.
used by permission of G. Grinstein, University of Massachusettes at Lowell

A 5-piece stick figure


(1 body and 4 limbs w.
different angle/length)

16
Two attributes mapped to axes, remaining attributes mapped to angle or length of limbs”. Look at texture pattern
Hierarchical Visualization Techniques

• Visualization of the data using a hierarchical


partitioning into subspaces
• Methods
– Dimensional Stacking
– Worlds-within-Worlds
– Tree-Map
– Cone Trees
– InfoCube

17
Dimensional Stacking

attribute 4
attribute 2

attribute 3

attribute 1

• Partitioning of the n-dimensional attribute space in 2-D


subspaces, which are ‘stacked’ into each other
• Partitioning of the attribute value ranges into classes. The
important attributes should be used on the outer levels.
• Adequate for data with ordinal attributes of low cardinality
• But, difficult to display more than nine dimensions
• Important to map dimensions appropriately
18
Worlds-within-Worlds
• Assign the function and two most important parameters to innermost world
• Fix all other parameters at constant values - draw other (1 or 2 or 3
dimensional worlds choosing these as the axes)
• Software that uses this paradigm

N–vision: Dynamic
interaction through data
glove and stereo displays,
including rotation, scaling
(inner) and translation
(inner/outer)
Auto Visual: Static
interaction by means of
queries

19
Tree-Map
• Screen-filling method which uses a hierarchical partitioning of
the screen into regions depending on the attribute values
• The x- and y-dimension of the screen are partitioned alternately
according to the attribute values (classes)

MSR Netscan Image

20
Ack.: http://www.cs.umd.edu/hcil/treemap-history/all102001.jpg
InfoCube
• A 3-D visualization technique where hierarchical
information is displayed as nested semi-
transparent cubes
• The outermost cubes correspond to the top level
data, while the subnodes or the lower level data
are represented as smaller cubes inside the
outermost cubes, and so on

21
Three-D Cone Trees
• 3D cone tree visualization technique works well
for up to a thousand nodes or so
• First build a 2D circle tree that arranges its
nodes in concentric circles centered on the
root node
• Cannot avoid overlaps when projected to 2D
• G. Robertson, J. Mackinlay, S. Card. “Cone
Trees: Animated 3D Visualizations of
Hierarchical Information”, ACM SIGCHI'91
• Graph from Nadeau Software Consulting
website: Visualize a social network data set
that models the way an infection spreads from
one person to the next

Ack.: http://nadeausoftware.com/articles/visualization
22
Visualizing Complex Data and Relations
• Visualizing non-numerical data: text and social networks
• Tag cloud: visualizing user-generated tags

The importance of tag is


represented by font size/color
Besides text data, there are also
methods to visualize relationships,
such as visualizing social networks.

Newsmap: Google News Stories in 2005


Dimensional Stacking
Used by permission of M. Ward, Worcester Polytechnic Institute

Visualization of oil mining data with longitude and latitude mapped to the
outer x-, y-axes and ore grade and depth mapped to the inner x-, y-axes
24

You might also like