0% found this document useful (0 votes)
46 views

Unit-5 BDA - Data Visualization

Uploaded by

aryanbond1009
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views

Unit-5 BDA - Data Visualization

Uploaded by

aryanbond1009
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 19

Big Data Analytics (702DB0C029)

UNIT-5
DATA VISUALIZATION IN BIG DATA

05/12/24 BIG DATA ANALYTICS 1


What is Data Visualization
•Data visualization is act of taking information (data) and placing it into a visual
context, such as map or graph
•The main goal is to make it easier to identify patterns, trends, insights and
outliers in large data sets.
•The term is often used interchangeably with others, including information
graphics, information visualization and statistical graphics.
•It gives an overview of the big data quickly and easily.

05/12/24 BIG DATA ANALYTICS 2


Need of Data visualization
•Human beings naturally find patterns in pictures than in rows of data easier
•Data visualization is one of the steps of the data science process, which states
that after data has been collected, processed and modeled, it must be visualized
for conclusions to be made.
•Even people who are good with SQL queries prefer a visual format to make
observations instead of a tabular format.

05/12/24 BIG DATA ANALYTICS 3


Benefits
•The ability to absorb information quickly, improve insights and make faster decisions
with less mistakes.
•An increased understanding of the next steps that must be taken to improve;
•An improved ability to maintain the audience’s interest with information they can
understand;
•It could be as simple as line charts, histograms and pie charts or a bit complex like
scatter plot, heat maps, tree maps, etc.
•Visualization of big data can also be done in 3-Dimensional graphs, based on the use
case.

05/12/24 BIG DATA ANALYTICS 4


05/12/24 BIG DATA ANALYTICS 5
Why is Data Visualization important in Big Data?
•The best part of big data visualization is capable of capturing data sets in the visual
format without loss of accuracy. One can control the factors like accuracy, precision,
level of aggregation that is required to serve the purpose.
•Another major benefit of visualization is the ability to show all the information in a
single place.
•It enables you to create dashboards and reports, which are packed with insights, that
can be shared across the organization.
•We need visualization of data in every industry - airline, IoT, energy, media and
entertainment, automotive, sports, manufacturing, the list is endless.

05/12/24 BIG DATA ANALYTICS 6


Type of Big Data Visualization (1)
•1. Line charts
◦ Also called a line graph or line plot is
a common chart.
◦ It is used to represent changes in one
variable against another, typically the
time.
◦ The data points are connected by
lines.
◦ It is used for identifying trends and
relationships between two variables.

05/12/24 BIG DATA ANALYTICS 7


Type of Big Data Visualization (2)
•2. Histograms
◦ It is used to represent the frequency distribution of
data.
◦ It groups data into logical ranges and depicts the
count of how many data points fall into each of those
ranges.
◦ It allows one to understand the nature of frequency
distributions.
◦ The distribution may be categorized as symmetric,
right-skewed and left skewed.
Type of Big Data Visualization (3)
•3. Bar chart
◦ Also called a bar graph, is used for depicting categorical data with rectangular strips/bars.
◦ The length of the bars shows the value or quantity of a variable.
◦ The bars are horizontal.
•4. Column Chart
◦ They are a straightforward, time-tested method of comparing several collections of data. A column
chart may be used to track data sets across time.
Type of Big Data Visualization (4)
•4. Pie charts
◦ Pie chart depicts the
information in the form of
“pie slices”.
◦ The “slices” are in proportion
to the relative sizes of data.

05/12/24 BIG DATA ANALYTICS 10


Type of Big Data Visualization (5)
•5. Heat Maps
◦ A heat map uses two-dimensional
representation of data in which
colors represent the values or
ranges.
◦ It provides a quick visual summary
of information.
◦ Uses colors to denote values; great
for seeing trends in huge datasets
Type of Big Data Visualization (6)
•6. Scatter plot
◦ It uses dots/points to show values for numeric
variables.
◦ The position of the dots against both the axes
indicates the value of that particular data point.
Type of Big Data Visualization (6)
•7. Tree map
◦ It is a visualization composed of nested
rectangles.
◦ These rectangles represent certain categories
within a selected dimension and are ordered in a
hierarchy, or “tree.”
◦ Quantities and patterns can be compared and
displayed in a limited chart space.
◦ Tree maps represent part to whole relationships.

05/12/24 BIG DATA ANALYTICS 13


Other types
•8. Bubble Chart
◦ A variant of the scatter plot where the size and color of the bubbles, which represent the data points,
provide extra information, are used to depict the data points as dots.
•9. Funnel Chart
◦ To illustrate a sequential process from top to bottom, a funnel
chart's principal purpose is to represent it graphically.
As the process flows down, the amount generally decreases,
making the data set at the top of the process greater than the bottom.

05/12/24 BIG DATA ANALYTICS 14


How to Select the Appropriate Graph or Chart for Your Data?
•Purpose
◦ What are you trying to visualize? Are you attempting to demonstrate contrasts, patterns, or
connections in your data?
•Type of Data
◦ What kind of data do you have? Is it a numerical or category list? Both continuous and discrete?
This will aid in choosing the best types of data visualization charts.
•Context
◦ What context does your data come from? Is it recent or historical? Local or worldwide? This will
enable you to choose the proper scale and coverage for your visualization.
•Selecting an appropriate visualization method is crucial. The type of visualization should align
with the nature of the data and the insights you want to communicate.

05/12/24 BIG DATA ANALYTICS 15


Best Practices for Visualizing Big Data
•Representing vast and complex datasets: ensure clarity and usability of the visualizations and also enhances the
understanding and actionable insights derived from big data.
•Simplify and Aggregate: Big data sets can be overwhelming and noisy. Simplification via aggregation can help. Aggregate
data to reduce complexity· Show trends or averages instead of individual data points. Use clustering to group similar data
and reduce the number of elements displayed.
•Interactive Elements: Interactivity allows users to explore and manipulate big data in ways static visualizations cannot.
Dynamic querying where users can customize data or variables to be visualized.
•Use Color Wisely: Color is a powerful tool but can also lead to confusion if misused. Use consistent color schemes across
multiple visualizations to maintain clarity. Limit the number of colors used to avoid visual overload.
•Contextualize the Data: Data without context can be misleading. Provide metadata that explains the data source,
collection method, and any modifications made to the data. Use annotations and legends to guide interpretation

05/12/24 BIG DATA ANALYTICS 17


Advantages of Data Visualization
•Enhanced Comparison: Visualizing performances of two elements or scenarios streamlines analysis, saving time
compared to traditional data examination.
•Improved Methodology: Representing data graphically offers a superior understanding of situations, exemplified by tools
like Google Trends illustrating industry trends in graphical forms.
•Efficient Data Sharing: Visual data presentation facilitates effective communication, making information more digestible
and engaging compared to sharing raw data.
•Sales Analysis: Data visualization aids sales professionals in comprehending product sales trends, identifying influencing
factors through tools like heat maps, and understanding customer types, geography impacts, and repeat customer behaviors.
•Identifying Event Relations: Discovering correlations between events helps businesses to understand external factors
affecting their performance, such as online sales surges during festive seasons.
•Exploring Opportunities and Trends: Data visualization empowers business leaders to uncover patterns and
opportunities within vast datasets, enabling a deeper understanding of customer behaviors and insights into emerging
business trends.

05/12/24 BIG DATA ANALYTICS 18


Challenges in Big Data Visualization
• Complex data: Big data often involves complex relationships and multiple dimensions. Big data can not visualize with the traditional method
as the traditional method has many limitations. Creating visualizations that accurately represent this complexity without overwhelming the user
is a delicate balance.
• Data Quality: Ensuring data accuracy, completeness, and consistency is critical for effective visualization. Poor data quality can lead to
misleading insights and erroneous decision-making.
• Scalability: Visualizing large datasets requires tools and techniques that can handle high volumes of data without compromising performance.
◦ Perceptual Scalability: Too many visualizations are not always possible to fit on a single screen.
◦ Real-time Scalability: It is always expected that all information should be real-time information, but it is hardly possible as processing the dataset
needs time.
◦ Interactive scalability: Interactive data visualization help to understand what is inside the datasets, but as its volume increases exponentially,
visualizing the datasets take a long time. But the challenge is sometimes, and the system may freeze or crash while trying to visualize the datasets.
• Integration: Integrating data from various sources into a cohesive visualization can be challenging. Ensuring compatibility and seamless
integration is crucial for comprehensive data analysis.

05/12/24 BIG DATA ANALYTICS 19


Data Visualization tools
•Tableau
•Infogram
•ChartBlocks
•D3.js
•Google Charts
•Fusion Charts
•Chart.js

05/12/24 BIG DATA ANALYTICS 20

You might also like