SlideShare a Scribd company logo
Data Analytics
Data Analytics
Accumulation of raw data captured from various sources (i.e.
discussion boards, emails, exam logs, chat logs in e-learning
systems) can be used to identify fruitful patterns and relationships.
Exploratory visualization – uses exploratory data analytics by
capturing relationships that are perhaps unknown or at least less
formally formulated
Confirmatory visualization - theory-driven
 2012 - Dr. Yair Levy and Dr. Michelle M. Ramim – Chais 2012 Conference, February 16, 2012.
What Can Be Learned From Data
Sets?
Business Intelligence (BI) or Business Analytics (BA)
Data analytics is an emerging technique that dives into a data set
without prior set of hypotheses
The data derive meaningful trends or intriguing findings that were not
previously seen or empirically validated.
Data analytics enables quick decisions or help change policies due to
trends observed
 2012 - Dr. Yair Levy and Dr. Michelle M. Ramim – Chais 2012 Conference, February 16, 2012.
The Power of Data Analytics
Source: http://mobile.informationweek.com/80256/show/488f5c42fd3f92317e5ac29faeee033e/
 2012 - Dr. Yair Levy and Dr. Michelle M. Ramim – Chais 2012 Conference, February 16, 2012.
Data Analytics vs. Statistical Analysis
Statistical Analysis
Utilizes statistical
and/or mathematical
techniques
Used based on
theoretical foundation
Seeks to identify a
significant level to
address hypotheses
or RQs
Data Analytics
Utilizes data mining
techniques
Identifies inexplicable
or novel
relationships/trends
Seeks to visualize the
data to allow the
observation of
relationships/trends
The Foundation for Analytics
Difference between social media analysis and traditional business
intelligence is the competitive information highly available on social
media.
Historically, companies have mostly dealt with their own data, and
eventually added studies based on external data published by
research organizations and so forth. Even in the digital age, web
site analysis and digital advertising still binds most of the useful
data to its owners, and being competitive has never been an easy
task.
Competitive intelligence and other aspects of social media data
and analytics create a new context for social media and a data-
driven strategy.
Social Media Data Sources: Offline
and Online

Offline originated data - Data that has been generated
with no connection to the Internet, and then registered into
a system, which may be accessed via the Internet later.
Examples include physical retail, printed press, live
events, telephone marketing and customer support,
traditional television audience measurements, and so forth

Online originated data - Created from systems
connected to the Internet. Examples include web sites, e-
commerce, media streaming, e-mail, mobile applications,
social media, online devices, and so forth.
Within these systems we might also have
different points of data generation. The social
media post itself is one source, for example,
and the comments under that same social
media post comes from a different source. Now
think of everything that happens on social
media, with all the different types of interactions
and quick sharing of content, there are many
data sources within each social media network.
Defining Social Media Data
Within online generated data, only a portion of it
is considered truly social media data. There is
always a gray area of debate on what social
media is and what it is not. But to make it
simple, we can define social media data as
data generated within a self-named social
media platform
Data Sources in social Media
Channels

Create a profile.

Publish content.

Praise or react to content (e.g., likes, favorites, etc.).

Comment on content.

Share content.

Create groups and content only available to the group.

Send direct messages and chat with other users.

Connect to another profile (as a friend, follower, etc.).

Purchase products and perform transactions .
Estimated Data sources and Factual
Data Sources

It is important to understand when data sources
represent a fact, and when they represent a possible
fact or an estimate. Especially when going into paid
media and content promotion, we come across many
estimated, or sometimes questionable, sources.

These estimated metrics include views, impressions,
and reach, for example, which provide the number of
times a user has potentially seen a certain piece of
content or advertising.

With a large amount of high-quality data, we
can reach very high-quality estimates, to the
point where we get our estimates correct and
statistically validated every single time. That is
the point where machines take over the
decision-making process, and automation kicks
in to take care of such tasks.
Public and Private Data

Public data is what anyone can see when
navigating a social platform.

Private data is what only the owner of the
social media profile can see.

If we are looking for competitive analytics and social media
benchmarking, which is really a must-do for optimizing
social strategy, we need public data for that. Public-level
data allows us to see the performance of our competitors’
social media channels, or any of the available information
that does not belong to us.

Two points are important to remember:
1. What is public can be easily compared.
2. What is private, if it does not belong to us, can only
be estimated.

Some services, for example, offer to detect paid
posts on Facebook. Paid information is private,
so it is not available if we don’t own the data.
Paid detection offers an estimate of it based on
a machine learning process.
Data Gathering in Social Media
Analytics
Data can be gathered in two ways when it comes
to pulling human-action

Via API (application programming interface)

Web crawling or scraping
API : Application Programming
Interface

An API is a structured channel of access into an
application. It allows a programmer to see a
clear structure of the information that is stored
in the application. This structure points the
programmer straight to the data that he or she
is looking for. Facebook offers API access to its
data. A programmer can look into Facebook
and request any specific information; for
example, “total likes for a certain page post”
Web Crawling or Scraping

Everything we see on the Internet has a source code driving it—a set of
instructions for all the systems connecting and interacting with the data. A
good-looking web site full of images has a set of hidden instructions telling
the browser how to display all that information.

A programmer can tap into the source code and then crawl for any specific
information needed. Other terms are also used to describe this process, such
as scraping .Web crawling/scraping is a very fragile and unstable way of
gathering data, because when anything changes on the web site, the source
code is changed, and the programmer has to reprogram a new way to crawl
for the information.

It is also likely that crawling can bump into privacy regulations from web
sites; the owners of these web sites will not like that very much. So
whenever possible, API access is the way to go. APIs are what most
analytics platforms rely upon.

More Related Content

PPTX
L3 SOCIAL MEDIA ANALYTICS_ AN OVERVIEW.pptx
PDF
Lecture3 business intelligence
PPTX
Social media data analytics
PDF
Mighty Guides- Data Disruption
PDF
Social media analytics - 5 key trends
PPTX
Social business intelligence an introduction
PPTX
Social media data analysis
PPTX
Introduction of Social Computing-Unit 1.pptx
L3 SOCIAL MEDIA ANALYTICS_ AN OVERVIEW.pptx
Lecture3 business intelligence
Social media data analytics
Mighty Guides- Data Disruption
Social media analytics - 5 key trends
Social business intelligence an introduction
Social media data analysis
Introduction of Social Computing-Unit 1.pptx

Similar to SMA-Unit-I: The Foundation for Analytics (20)

PPT
Social Media and Text Analytics for Business.ppt
PDF
Social media Enabling Smart Decisions
PDF
Big data overview
PDF
Social-Media-Analytics-Enabling-Intelligent-Real-Time-Decision-Making
PPTX
The power of social media anlaytics
PDF
Soneta. Social Network Analyzer
PDF
DATACTIF SoNetA. BIG DATA ANALYTIS
PPTX
Social media & big data
PDF
Introduction to Data Analytics, AKTU - UNIT-1
PDF
No BS Monitoring and Measurement
PPTX
Marketing in a social age travel updated (feb)
PPTX
727325165-Unit-1-Data-Analytics-PPT-1.pptx
PDF
social media analytics.pdf
PDF
Maximize ROI with Social Media Data Mining Services.pdf
PPTX
Analytics Best Practice for the Travel Industry
PPTX
Social Media Data Collection & Analysis
PPT
BAQMaR - Conference DM
PPTX
Spark Social Media
PPTX
Interactive media analytics presentation
PPTX
Interactive media analytics presentation
Social Media and Text Analytics for Business.ppt
Social media Enabling Smart Decisions
Big data overview
Social-Media-Analytics-Enabling-Intelligent-Real-Time-Decision-Making
The power of social media anlaytics
Soneta. Social Network Analyzer
DATACTIF SoNetA. BIG DATA ANALYTIS
Social media & big data
Introduction to Data Analytics, AKTU - UNIT-1
No BS Monitoring and Measurement
Marketing in a social age travel updated (feb)
727325165-Unit-1-Data-Analytics-PPT-1.pptx
social media analytics.pdf
Maximize ROI with Social Media Data Mining Services.pdf
Analytics Best Practice for the Travel Industry
Social Media Data Collection & Analysis
BAQMaR - Conference DM
Spark Social Media
Interactive media analytics presentation
Interactive media analytics presentation
Ad

More from DEEPAK948083 (20)

PPT
Basics of RFID Technologyddscccccddd.ppt
PPT
turban_ch07ch07ch07ch07ch07ch07dss9e_ch07.ppt
PPT
introAdhocRoutingRoutingRoutingRouting-new.ppt
PPT
SensorSensorSensorSensorSensorSensor.ppt
PPT
Chapter1_IntroductionIntroductionIntroduction.ppt
PPT
introDMintroDMintroDMintroDMintroDMintroDM.ppt
PPT
lect1lect1lect1lect1lect1lect1lect1lect1.ppt
PPTX
Chchchchchchchchchchchchchchchchc 11.pptx
PPT
applicationapplicationapplicationapplication.ppt
PPT
MOBILE & WIRELESS SECURITY And MOBILE & WIRELESS SECURITY
PPTX
datastructureppt-190327174340 (1).pptx
PPTX
5virusandmaliciouscodechapter5-130716024935-phpapp02-converted.pptx
PPT
Lect no 13 ECC.ppt
PPTX
block ciphermodes of operation.pptx
PPT
Lect no 13 ECC.ppt
PPTX
unit1Intro_final.pptx
PPT
whitman_ch04.ppt
PPT
lesson333.ppt
PPT
ICS PPT Unit 4.ppt
PPTX
stack-Intro.pptx
Basics of RFID Technologyddscccccddd.ppt
turban_ch07ch07ch07ch07ch07ch07dss9e_ch07.ppt
introAdhocRoutingRoutingRoutingRouting-new.ppt
SensorSensorSensorSensorSensorSensor.ppt
Chapter1_IntroductionIntroductionIntroduction.ppt
introDMintroDMintroDMintroDMintroDMintroDM.ppt
lect1lect1lect1lect1lect1lect1lect1lect1.ppt
Chchchchchchchchchchchchchchchchc 11.pptx
applicationapplicationapplicationapplication.ppt
MOBILE & WIRELESS SECURITY And MOBILE & WIRELESS SECURITY
datastructureppt-190327174340 (1).pptx
5virusandmaliciouscodechapter5-130716024935-phpapp02-converted.pptx
Lect no 13 ECC.ppt
block ciphermodes of operation.pptx
Lect no 13 ECC.ppt
unit1Intro_final.pptx
whitman_ch04.ppt
lesson333.ppt
ICS PPT Unit 4.ppt
stack-Intro.pptx
Ad

Recently uploaded (20)

PDF
Topic-5-Human-Dignity-and-Rights-PPT.pdf
DOCX
Finished Fuel Storage Dependable Welded Steel Tanks.docx
PDF
Session 9 Perspectives by Africa’s academia, research, policymakers, and priv...
PPTX
My Glocal Green Election Initiatives Punjab, Mumbai and Gujarat
PDF
cbam presentation.pdf_for exporters exporting tpo Europe
PPTX
Nitrogen deficiency in plants final.pptx
PPTX
challenges and limitations in biofertilizers .pptx
PDF
2025 MyRWA Wicked Cool Mystic Newsletter.pdf
PPTX
carbon footprint, emissioncontrol and carbon tax
PDF
CME-I_Week 1_ Units and Conversion_Suresh B S.pdf
PPTX
Session8a AR6 Findings Working Group I Vice-Chair Nana Ama Browne Klutse
PPTX
BENJAMIN JOSE M. LEGASPI POWERPOINT.pptx
PPTX
Great Molasses Flood or boston molasses food
DOCX
Welded Steel Tanks Practical Storage for Used Motor Oil.docx
DOCX
Marine Fuel Bunkering Tanks Reliable Welded Steel Solutions.docx
PPTX
DEFENSE MECH AND HOST RESPonse to plaquetx
DOCX
Aviation Fuel Storage Precision Welded Steel Tanks.docx
PPTX
Session 9: Panel 3 - Sports for Climate Change
PPTX
EARTHQUAKE (1).pptx natural disaster notes
PPTX
Plant Diseases and Deficiencies Biology Presentation in a Colorful Illustrati...
Topic-5-Human-Dignity-and-Rights-PPT.pdf
Finished Fuel Storage Dependable Welded Steel Tanks.docx
Session 9 Perspectives by Africa’s academia, research, policymakers, and priv...
My Glocal Green Election Initiatives Punjab, Mumbai and Gujarat
cbam presentation.pdf_for exporters exporting tpo Europe
Nitrogen deficiency in plants final.pptx
challenges and limitations in biofertilizers .pptx
2025 MyRWA Wicked Cool Mystic Newsletter.pdf
carbon footprint, emissioncontrol and carbon tax
CME-I_Week 1_ Units and Conversion_Suresh B S.pdf
Session8a AR6 Findings Working Group I Vice-Chair Nana Ama Browne Klutse
BENJAMIN JOSE M. LEGASPI POWERPOINT.pptx
Great Molasses Flood or boston molasses food
Welded Steel Tanks Practical Storage for Used Motor Oil.docx
Marine Fuel Bunkering Tanks Reliable Welded Steel Solutions.docx
DEFENSE MECH AND HOST RESPonse to plaquetx
Aviation Fuel Storage Precision Welded Steel Tanks.docx
Session 9: Panel 3 - Sports for Climate Change
EARTHQUAKE (1).pptx natural disaster notes
Plant Diseases and Deficiencies Biology Presentation in a Colorful Illustrati...

SMA-Unit-I: The Foundation for Analytics

  • 1. Data Analytics Data Analytics Accumulation of raw data captured from various sources (i.e. discussion boards, emails, exam logs, chat logs in e-learning systems) can be used to identify fruitful patterns and relationships. Exploratory visualization – uses exploratory data analytics by capturing relationships that are perhaps unknown or at least less formally formulated Confirmatory visualization - theory-driven
  • 2.  2012 - Dr. Yair Levy and Dr. Michelle M. Ramim – Chais 2012 Conference, February 16, 2012. What Can Be Learned From Data Sets? Business Intelligence (BI) or Business Analytics (BA) Data analytics is an emerging technique that dives into a data set without prior set of hypotheses The data derive meaningful trends or intriguing findings that were not previously seen or empirically validated. Data analytics enables quick decisions or help change policies due to trends observed
  • 3.  2012 - Dr. Yair Levy and Dr. Michelle M. Ramim – Chais 2012 Conference, February 16, 2012. The Power of Data Analytics Source: http://mobile.informationweek.com/80256/show/488f5c42fd3f92317e5ac29faeee033e/
  • 4.  2012 - Dr. Yair Levy and Dr. Michelle M. Ramim – Chais 2012 Conference, February 16, 2012. Data Analytics vs. Statistical Analysis Statistical Analysis Utilizes statistical and/or mathematical techniques Used based on theoretical foundation Seeks to identify a significant level to address hypotheses or RQs Data Analytics Utilizes data mining techniques Identifies inexplicable or novel relationships/trends Seeks to visualize the data to allow the observation of relationships/trends
  • 5. The Foundation for Analytics Difference between social media analysis and traditional business intelligence is the competitive information highly available on social media. Historically, companies have mostly dealt with their own data, and eventually added studies based on external data published by research organizations and so forth. Even in the digital age, web site analysis and digital advertising still binds most of the useful data to its owners, and being competitive has never been an easy task. Competitive intelligence and other aspects of social media data and analytics create a new context for social media and a data- driven strategy.
  • 6. Social Media Data Sources: Offline and Online  Offline originated data - Data that has been generated with no connection to the Internet, and then registered into a system, which may be accessed via the Internet later. Examples include physical retail, printed press, live events, telephone marketing and customer support, traditional television audience measurements, and so forth  Online originated data - Created from systems connected to the Internet. Examples include web sites, e- commerce, media streaming, e-mail, mobile applications, social media, online devices, and so forth.
  • 7. Within these systems we might also have different points of data generation. The social media post itself is one source, for example, and the comments under that same social media post comes from a different source. Now think of everything that happens on social media, with all the different types of interactions and quick sharing of content, there are many data sources within each social media network.
  • 8. Defining Social Media Data Within online generated data, only a portion of it is considered truly social media data. There is always a gray area of debate on what social media is and what it is not. But to make it simple, we can define social media data as data generated within a self-named social media platform
  • 9. Data Sources in social Media Channels  Create a profile.  Publish content.  Praise or react to content (e.g., likes, favorites, etc.).  Comment on content.  Share content.  Create groups and content only available to the group.  Send direct messages and chat with other users.  Connect to another profile (as a friend, follower, etc.).  Purchase products and perform transactions .
  • 10. Estimated Data sources and Factual Data Sources  It is important to understand when data sources represent a fact, and when they represent a possible fact or an estimate. Especially when going into paid media and content promotion, we come across many estimated, or sometimes questionable, sources.  These estimated metrics include views, impressions, and reach, for example, which provide the number of times a user has potentially seen a certain piece of content or advertising.
  • 11.  With a large amount of high-quality data, we can reach very high-quality estimates, to the point where we get our estimates correct and statistically validated every single time. That is the point where machines take over the decision-making process, and automation kicks in to take care of such tasks.
  • 12. Public and Private Data  Public data is what anyone can see when navigating a social platform.  Private data is what only the owner of the social media profile can see.
  • 13.  If we are looking for competitive analytics and social media benchmarking, which is really a must-do for optimizing social strategy, we need public data for that. Public-level data allows us to see the performance of our competitors’ social media channels, or any of the available information that does not belong to us.  Two points are important to remember: 1. What is public can be easily compared. 2. What is private, if it does not belong to us, can only be estimated.
  • 14.  Some services, for example, offer to detect paid posts on Facebook. Paid information is private, so it is not available if we don’t own the data. Paid detection offers an estimate of it based on a machine learning process.
  • 15. Data Gathering in Social Media Analytics Data can be gathered in two ways when it comes to pulling human-action  Via API (application programming interface)  Web crawling or scraping
  • 16. API : Application Programming Interface  An API is a structured channel of access into an application. It allows a programmer to see a clear structure of the information that is stored in the application. This structure points the programmer straight to the data that he or she is looking for. Facebook offers API access to its data. A programmer can look into Facebook and request any specific information; for example, “total likes for a certain page post”
  • 17. Web Crawling or Scraping  Everything we see on the Internet has a source code driving it—a set of instructions for all the systems connecting and interacting with the data. A good-looking web site full of images has a set of hidden instructions telling the browser how to display all that information.  A programmer can tap into the source code and then crawl for any specific information needed. Other terms are also used to describe this process, such as scraping .Web crawling/scraping is a very fragile and unstable way of gathering data, because when anything changes on the web site, the source code is changed, and the programmer has to reprogram a new way to crawl for the information.  It is also likely that crawling can bump into privacy regulations from web sites; the owners of these web sites will not like that very much. So whenever possible, API access is the way to go. APIs are what most analytics platforms rely upon.