SlideShare a Scribd company logo
Data Analytics
Dr. Vala Ali Rohani
Vala@um.edu.my
VRohani@gmail.com
My Bio Data
• Postdoctoral scholar in Social Network Analysis
• PhD in Software Engineering (Recommender Systems)
• Social Network Analysis from University of Michigan
Professional Certificates:
• University Lecturer for more than 10 years
• Mining Massive Datasets from Stanford University
• Pattern Discovery in Data Mining from Illinois University
• Process Mining from Eindhoven University of Technology
• Statistical Analysis using SPSS & SAS from University of Malaya
• MongoDB for DBAs from MongoDB
Data Analytics by Vala Ali Rohani
Presentation outline
Data Science & Big Data
Social Network Analysis
Process Mining
Market Basket Analysis
Data Analytics by Vala Ali Rohani
Data Analytics by Vala Ali Rohani
Data Analytics
&
Big Data
Domain Terminology
Data Analytics by Vala Ali Rohani
Data Science & Big Data
• Data Analysis, Data Mining, Machine Learning and Mathematical Modeling are
tools: means towards an end.
• Analytics, Business Intelligence, Econometrics and Artificial Intelligence are
application areas: domains that use the tools above (and others) to produce results
within its subject.
• Statistics is a branch of Mathematics providing theoretical and practical support to the
above tools.
• Data Science is a catch-all term to describe using those all tools to provide answers in
those all areas (and also in others), specially when dealing with Big Data
http://www.quora.com/What-is-the-difference-between-Data-Analytics-Data-Analysis-Data-Mining-Data-Science-Machine-Learning-
and-Big-Data-1
Data Analytics by Vala Ali Rohani
Data is the New Oil!
In the last 10 minutes we generated more data than from prehistoric times until 2003!
Data Science & Big Data
Data Analytics by Vala Ali Rohani
A data scientist is able to collect, analyze, and interpret data
from a variety of sources (social interaction, business
processes, cyber-physical systems).
Turning data into value!
Data Science & Big Data
Data Analytics by Vala Ali Rohani
Four generic data science questions:
1. What happened?
2. Why did it happen?
3. What will happen?
4. What is the best that can happen?
Data Science & Big Data
Data Analytics by Vala Ali Rohani
Data Science & Big Data
Data Analytics by Vala Ali Rohani
Data Science & Big Data
Big data is a broad term for data sets so large or complex that traditional data processing
applications are inadequate.
http://en.wikipedia.org/wiki/Big_data
How Much Data?
1 PB = 1000000000000000B = 1015 bytes = 1000terabytes.
• Google processes 20 PB a day (2008)
• Facebook has 2.5 PB of user data + 15 TB/day (4/2009)
• Each engine of Boeing 747 generates 20 TB of information per hour
Data Analytics by Vala Ali Rohani
Data Science & Big Data
Data Analytics by Vala Ali Rohani
Data Science & Big Data
Some Big Data Theories and Techniques
Map-Reduce
Market Basket Analysis
Pattern Discovery
Social Network Analysis
Process Mining
Data Analytics by Vala Ali Rohani
Social Network
Analysis
(SNA)
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Every thing is connected
When you sell items …
When you receive customer calls ... When you make a contract …
When you ship orders …
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
What are Networks?
• Networks are sets of nodes connected by edges
“Network” ≡ “Graph”
node
edge
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
What is SNA?
SNA (Social Network Analysis) is the mapping and measuring of relationships and
flows between people, groups, organizations, computers, URLs, and other
connected entities
SNA provides both a visual and a
mathematical analysis of human
relationships.
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Why do we need Social Network Analysis?
• Are nodes connected through the network?
• How far apart are they?
• Are some nodes more important due to their position in the
network?
• How will be the patterns for information diffusion?
• Is the network composed of communities?
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Now,
let’s see some samples of SNA …
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Internet
structure of the Internet at the level of autonomous systems. Data source: Mark
Newman http://www-personal.umich.edu/~mejn/netdata/.
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Political Blogs
2004 United States Presidential Election Network
Liberals
Conservatives
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Facebook Friendship Network
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA in Organizations (or ONA)
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA Metrics :
Degree
Betweenness
Closeness
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA Main Centrality Metrics
Degree
The number of direct connections that a node has
𝑑𝑖 =
𝑗 𝑎𝑖𝑗
(𝑛 − 1)
SNA Main Centrality Metrics
Betweenness
Betweenness centrality identifies an entity's position within a network in terms of its
ability to make connections to other pairs or groups in a network.
CB (i) = gjk (i)/gjk
j<k
å
CB
'
(i) = CB (i )/[(n -1)(n -2)/2]
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA Main Centrality Metrics
Closeness
Closeness centrality measures how quickly an entity can access more entities in a
network.
Cc (i) = d(i, j)
j=1
N
å
é
ë
ê
ê
ù
û
ú
ú
-1
CC
'
(i) = (CC (i))/(N -1)
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA Tools:
NodeXL
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA Tools:
Gephi
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
SNA Tools:
UCINET
Key nodes in Organization
(from ONA view)
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Find a node that has high betweenness but
low degree
Data Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Find a node that has low betweenness but
high degree
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Social Network Analysis (SNA)
Data Analytics by Vala Ali Rohani
Process Mining
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Process mining is the missing link between model-based process analysis and
data-oriented analysis techniques.
Process mining seeks the confrontation between event data (i.e., observed behavior)
and process models (hand-made or discovered automatically).
Some example applications include:
• Analyzing treatment processes in hospitals
• Improving customer service processes
• Understanding the browsing behavior of customers using a booking site
• Analyzing failures of a baggage handling system
What is Process Mining?
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
• What is the process that people really follow?
• Where are the bottlenecks in the studied process?
• Where do people (or machines) deviate from the expected or
idealized process?
• What are the "highways" in my process?
• What factors are influencing a bottleneck?
• Can we predict problems (delay, deviation, risk, etc.) for
running cases?
• Can we recommend some improvements for main process of the
organization?
• How to redesign the process / organization / machine?
Process mining use cases
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
https://www.coursera.org/course/procmin
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
Some Examples of Real Discovered Processes
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
Some Examples of Real Discovered Processes
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
Some Examples of Real Discovered Processes
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Process Mining
Some Examples of Real Discovered Processes
Data Analytics by Vala Ali Rohani
Market Basket
Analysis
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Market Basket Analysis
Introduction
Market Basket Analysis (MBA) is a data mining technique which is widely used in the
consumer package goods (CPG) industry to identify which items are purchased together
and, more importantly, how the purchase of one item affects the likelihood of another
item being purchased.
Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Market Basket Analysis
SALES TRANSACTIONS
Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis
Our imaginary store sales the following items: bananas, bologna, bread, buns, butter, cereal,
cheese, chips, eggs, hotdogs, mayo, milk, mustard, oranges, pickles, and soda. We have
recorded 20 sales transactions as follows:
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Market Basket Analysis
MBA Theories:
Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis
Support for itemset I = the number of baskets containing all items in I.
Given a support threshold s, sets of items that appear in at least s baskets are called
frequent itemsets.
Association rules are If‐then rules about the contents of baskets.
Confidence of this association rule is the probability of j given i1,…,ik.
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Market Basket Analysis
MBA Theories:
Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Market Basket Analysis
Market Basket Example
Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani
Thank you

More Related Content

PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PPTX
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
PDF
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
PPTX
Big Data Analytics
PDF
Data Catalog for Better Data Discovery and Governance
PDF
DI&A Slides: Data Lake vs. Data Warehouse
PDF
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
PDF
Introduction To Data Science
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
Big Data Analytics
Data Catalog for Better Data Discovery and Governance
DI&A Slides: Data Lake vs. Data Warehouse
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
Introduction To Data Science

What's hot (20)

PDF
8 Steps to Creating a Data Strategy
PPTX
1. Data Analytics-introduction
PPTX
introduction to data science
PPT
Introduction to Data Mining
PPTX
Tableau free tutorial
PPTX
Introduction to Data Engineering
PDF
Data Analytics PowerPoint Presentation Slides
PDF
Summary introduction to data engineering
PPTX
Data visualization
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PPTX
Data engineering
PDF
DAS Slides: Data Governance - Combining Data Management with Organizational ...
PDF
Learn to Use Databricks for Data Science
PDF
Data Visualization With Tableau | Edureka
PDF
Data Architecture Strategies
PPTX
Tableau Visual analytics complete deck 2
PDF
Data Engineering Basics
PDF
Data Quality Best Practices
PDF
Data-Ed Webinar: Data Governance Strategies
8 Steps to Creating a Data Strategy
1. Data Analytics-introduction
introduction to data science
Introduction to Data Mining
Tableau free tutorial
Introduction to Data Engineering
Data Analytics PowerPoint Presentation Slides
Summary introduction to data engineering
Data visualization
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Data engineering
DAS Slides: Data Governance - Combining Data Management with Organizational ...
Learn to Use Databricks for Data Science
Data Visualization With Tableau | Edureka
Data Architecture Strategies
Tableau Visual analytics complete deck 2
Data Engineering Basics
Data Quality Best Practices
Data-Ed Webinar: Data Governance Strategies
Ad

Viewers also liked (20)

PPTX
Hadoop on Windows 8
PDF
User behavior model & recommendation on basis of social networks
PPT
PDF
the near future of tourism services based on digital traces
PPT
R5 What Is The Impact Of Urban Activities
PPTX
Unmetric facebook analysis
PDF
Designing The Social In
PPTX
Urban Agents and Citizen Apps
PPTX
Urban Impact Framework
PPT
Multidimensional Patterns of Disturbance in Digital Social Networks
PPT
CSCW 2011 Talk on "Activity Analysis"
PPTX
User Behaviour Pattern Recognition On Twitter Social Network
PPT
Points of distribution
PDF
Introduction to power laws
PPTX
Zipf distribution
PDF
Impact of Urban Logistics of Commercial Vehicles
PPTX
Distinguish between chinese urban planning and american urban planning
PPT
Intro To Power Laws (March 2008)
PDF
U-Tool: A Urban-Toolkit for enhancing city maps through citizens’ activity
Hadoop on Windows 8
User behavior model & recommendation on basis of social networks
the near future of tourism services based on digital traces
R5 What Is The Impact Of Urban Activities
Unmetric facebook analysis
Designing The Social In
Urban Agents and Citizen Apps
Urban Impact Framework
Multidimensional Patterns of Disturbance in Digital Social Networks
CSCW 2011 Talk on "Activity Analysis"
User Behaviour Pattern Recognition On Twitter Social Network
Points of distribution
Introduction to power laws
Zipf distribution
Impact of Urban Logistics of Commercial Vehicles
Distinguish between chinese urban planning and american urban planning
Intro To Power Laws (March 2008)
U-Tool: A Urban-Toolkit for enhancing city maps through citizens’ activity
Ad

Similar to Data Analytics (20)

PDF
Agile data science
PPTX
Big Data Forum - Phoenix
PPTX
Jisc learning analytics MASHEIN Jan 2017
PPTX
Business Intelligence and Big Data in Cloud
PPT
Data mining
PDF
Data Science Business Analytics Sneha Kumari K K Tripathy
PDF
Moving Beyond Batch: Transactional Databases for Real-time Data
PDF
Employees, Business Partners and Bad Guys: What Web Data Reveals About Person...
PDF
The Transpose Technique On Number Of Transactions Of...
PPTX
Advanced Analytics and Data Science Expertise
PPT
Search Analytics: Diagnosing what ails your site
PDF
Business Analytics and Data mining.pdf
PDF
Data sci sd-11.6.17
PPTX
The Role of Community-Driven Data Curation for Enterprises
PDF
Getstarteddssd12717sd
PDF
Information Systems in Organizations 1st Edition Patricia Wallace Solutions M...
PDF
Understanding big data and data analytics-Business Intelligence
PPTX
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
PPTX
Big data Analytics
PDF
Discovering Big Data in the Fog: Why Catalogs Matter
Agile data science
Big Data Forum - Phoenix
Jisc learning analytics MASHEIN Jan 2017
Business Intelligence and Big Data in Cloud
Data mining
Data Science Business Analytics Sneha Kumari K K Tripathy
Moving Beyond Batch: Transactional Databases for Real-time Data
Employees, Business Partners and Bad Guys: What Web Data Reveals About Person...
The Transpose Technique On Number Of Transactions Of...
Advanced Analytics and Data Science Expertise
Search Analytics: Diagnosing what ails your site
Business Analytics and Data mining.pdf
Data sci sd-11.6.17
The Role of Community-Driven Data Curation for Enterprises
Getstarteddssd12717sd
Information Systems in Organizations 1st Edition Patricia Wallace Solutions M...
Understanding big data and data analytics-Business Intelligence
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big data Analytics
Discovering Big Data in the Fog: Why Catalogs Matter

Recently uploaded (20)

PDF
[EN] Industrial Machine Downtime Prediction
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
IMPACT OF LANDSLIDE.....................
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
Leprosy and NLEP programme community medicine
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
annual-report-2024-2025 original latest.
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
Business_Capability_Map_Collection__pptx
PDF
Transcultural that can help you someday.
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
[EN] Industrial Machine Downtime Prediction
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
DU, AIS, Big Data and Data Analytics.ppt
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Qualitative Qantitative and Mixed Methods.pptx
IMPACT OF LANDSLIDE.....................
Pilar Kemerdekaan dan Identi Bangsa.pptx
Leprosy and NLEP programme community medicine
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
annual-report-2024-2025 original latest.
A Complete Guide to Streamlining Business Processes
Business_Capability_Map_Collection__pptx
Transcultural that can help you someday.
STERILIZATION AND DISINFECTION-1.ppthhhbx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...

Data Analytics

  • 2. My Bio Data • Postdoctoral scholar in Social Network Analysis • PhD in Software Engineering (Recommender Systems) • Social Network Analysis from University of Michigan Professional Certificates: • University Lecturer for more than 10 years • Mining Massive Datasets from Stanford University • Pattern Discovery in Data Mining from Illinois University • Process Mining from Eindhoven University of Technology • Statistical Analysis using SPSS & SAS from University of Malaya • MongoDB for DBAs from MongoDB Data Analytics by Vala Ali Rohani
  • 3. Presentation outline Data Science & Big Data Social Network Analysis Process Mining Market Basket Analysis Data Analytics by Vala Ali Rohani
  • 4. Data Analytics by Vala Ali Rohani Data Analytics & Big Data
  • 5. Domain Terminology Data Analytics by Vala Ali Rohani Data Science & Big Data • Data Analysis, Data Mining, Machine Learning and Mathematical Modeling are tools: means towards an end. • Analytics, Business Intelligence, Econometrics and Artificial Intelligence are application areas: domains that use the tools above (and others) to produce results within its subject. • Statistics is a branch of Mathematics providing theoretical and practical support to the above tools. • Data Science is a catch-all term to describe using those all tools to provide answers in those all areas (and also in others), specially when dealing with Big Data http://www.quora.com/What-is-the-difference-between-Data-Analytics-Data-Analysis-Data-Mining-Data-Science-Machine-Learning- and-Big-Data-1
  • 6. Data Analytics by Vala Ali Rohani Data is the New Oil! In the last 10 minutes we generated more data than from prehistoric times until 2003! Data Science & Big Data
  • 7. Data Analytics by Vala Ali Rohani A data scientist is able to collect, analyze, and interpret data from a variety of sources (social interaction, business processes, cyber-physical systems). Turning data into value! Data Science & Big Data
  • 8. Data Analytics by Vala Ali Rohani Four generic data science questions: 1. What happened? 2. Why did it happen? 3. What will happen? 4. What is the best that can happen? Data Science & Big Data
  • 9. Data Analytics by Vala Ali Rohani Data Science & Big Data
  • 10. Data Analytics by Vala Ali Rohani Data Science & Big Data Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. http://en.wikipedia.org/wiki/Big_data How Much Data? 1 PB = 1000000000000000B = 1015 bytes = 1000terabytes. • Google processes 20 PB a day (2008) • Facebook has 2.5 PB of user data + 15 TB/day (4/2009) • Each engine of Boeing 747 generates 20 TB of information per hour
  • 11. Data Analytics by Vala Ali Rohani Data Science & Big Data
  • 12. Data Analytics by Vala Ali Rohani Data Science & Big Data Some Big Data Theories and Techniques Map-Reduce Market Basket Analysis Pattern Discovery Social Network Analysis Process Mining
  • 13. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 14. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) Every thing is connected When you sell items … When you receive customer calls ... When you make a contract … When you ship orders …
  • 15. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) What are Networks? • Networks are sets of nodes connected by edges “Network” ≡ “Graph” node edge
  • 16. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) What is SNA? SNA (Social Network Analysis) is the mapping and measuring of relationships and flows between people, groups, organizations, computers, URLs, and other connected entities SNA provides both a visual and a mathematical analysis of human relationships.
  • 17. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) Why do we need Social Network Analysis? • Are nodes connected through the network? • How far apart are they? • Are some nodes more important due to their position in the network? • How will be the patterns for information diffusion? • Is the network composed of communities?
  • 18. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) Now, let’s see some samples of SNA …
  • 19. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) Internet structure of the Internet at the level of autonomous systems. Data source: Mark Newman http://www-personal.umich.edu/~mejn/netdata/.
  • 20. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) Political Blogs 2004 United States Presidential Election Network Liberals Conservatives
  • 21. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) Facebook Friendship Network
  • 22. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) SNA in Organizations (or ONA)
  • 23. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) SNA Metrics : Degree Betweenness Closeness
  • 24. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) SNA Main Centrality Metrics Degree The number of direct connections that a node has 𝑑𝑖 = 𝑗 𝑎𝑖𝑗 (𝑛 − 1)
  • 25. SNA Main Centrality Metrics Betweenness Betweenness centrality identifies an entity's position within a network in terms of its ability to make connections to other pairs or groups in a network. CB (i) = gjk (i)/gjk j<k å CB ' (i) = CB (i )/[(n -1)(n -2)/2] Data Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 26. SNA Main Centrality Metrics Closeness Closeness centrality measures how quickly an entity can access more entities in a network. Cc (i) = d(i, j) j=1 N å é ë ê ê ù û ú ú -1 CC ' (i) = (CC (i))/(N -1) Data Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 27. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) SNA Tools: NodeXL
  • 28. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) SNA Tools: Gephi
  • 29. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA) SNA Tools: UCINET
  • 30. Key nodes in Organization (from ONA view) Data Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 31. Data Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 32. Find a node that has high betweenness but low degree Data Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 33. Find a node that has low betweenness but high degree Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Social Network Analysis (SNA)
  • 34. Data Analytics by Vala Ali Rohani Process Mining
  • 35. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin
  • 36. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin Process mining is the missing link between model-based process analysis and data-oriented analysis techniques. Process mining seeks the confrontation between event data (i.e., observed behavior) and process models (hand-made or discovered automatically). Some example applications include: • Analyzing treatment processes in hospitals • Improving customer service processes • Understanding the browsing behavior of customers using a booking site • Analyzing failures of a baggage handling system What is Process Mining?
  • 37. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin • What is the process that people really follow? • Where are the bottlenecks in the studied process? • Where do people (or machines) deviate from the expected or idealized process? • What are the "highways" in my process? • What factors are influencing a bottleneck? • Can we predict problems (delay, deviation, risk, etc.) for running cases? • Can we recommend some improvements for main process of the organization? • How to redesign the process / organization / machine? Process mining use cases
  • 38. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin
  • 39. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin
  • 40. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin
  • 41. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin
  • 42. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining https://www.coursera.org/course/procmin
  • 43. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining Some Examples of Real Discovered Processes
  • 44. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining Some Examples of Real Discovered Processes
  • 45. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining Some Examples of Real Discovered Processes
  • 46. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Process Mining Some Examples of Real Discovered Processes
  • 47. Data Analytics by Vala Ali Rohani Market Basket Analysis
  • 48. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Market Basket Analysis Introduction Market Basket Analysis (MBA) is a data mining technique which is widely used in the consumer package goods (CPG) industry to identify which items are purchased together and, more importantly, how the purchase of one item affects the likelihood of another item being purchased. Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis
  • 49. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Market Basket Analysis SALES TRANSACTIONS Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis Our imaginary store sales the following items: bananas, bologna, bread, buns, butter, cereal, cheese, chips, eggs, hotdogs, mayo, milk, mustard, oranges, pickles, and soda. We have recorded 20 sales transactions as follows:
  • 50. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Market Basket Analysis MBA Theories: Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis Support for itemset I = the number of baskets containing all items in I. Given a support threshold s, sets of items that appear in at least s baskets are called frequent itemsets. Association rules are If‐then rules about the contents of baskets. Confidence of this association rule is the probability of j given i1,…,ik.
  • 51. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Market Basket Analysis MBA Theories: Bill Qualls, First Analytics, Raleigh, NC, Introduction to Market Basket Analysis
  • 52. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Market Basket Analysis Market Basket Example
  • 53. Data Analytics by Vala Ali RohaniData Analytics by Vala Ali Rohani Thank you