Segmenting Sequences of
Node-labeled Graphs
Sorour E. Amiri, Liangzhe Chen, B. Aditya Prakash
Department of Computer Science
Virginia Tech
ICDM, DaMNet, Barcelona, Spain, December 12, 2016
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Experiments
 Conclusion
Amiri, Chen, Prakash 1
Network Sequences
 Epidemiology: disease spreads over contact networks
 Social Media: Information spreads over friendship networks
2
Flu
Meme
Amiri, Chen, Prakash
Making sense of network sequences
3
Flu
when do the infection patterns change?
Star Bridge Near Clique
Reason:
• Virus mutation
• Vaccination
• …
Amiri, Chen, Prakash
Making sense of network sequences
4
Meme Reason:
• Event
• …
Star Clique
when do the infection patterns change?
Amiri, Chen, Prakash
Problem 1: Network sequence segmentation
 Given a sequence of networks with labeled nodes,
 Find the best segmentation which captures:
 Different distribution of node labels.
5
Star Bridge Near Clique
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Experiments
 Conclusion
6Amiri, Chen, Prakash
Alternative 1: Feature Ext. &Time-series
7
0 0 0 … 2F1: #cliques (of active subgraph)
F2: #ladders (of inactive subgraph)
F3: #ladders (of active subgraph)
1 1 0 … 0
0 0 0 … 1
[Henderson et al. 2010]
[Likas, Vlassis, and Verbeek 2003]
[Li et al. 2009]
Amiri, Chen, Prakash
-1
0
1
2
G1 G2 G3 G4
Features time series
F1 F2 F3
Step 1: Feature Extraction
Step 2: Time-series
segmentation
Alternative 1: Feature Ext. &Time-series
 Drawbacks:
 Laborious feature-engineering
 “Local” change detection:
o One aggregation time period
o Threshold
8Amiri, Chen, Prakash
-1
0
1
2
G1 G2 G3 G4
Features time series
F1 F2 F3
Alternative 2: Plain-graph-based analysis
9
[Shah et al. 2015]
[Sun et al. 2007]
[Lin et al. 2009]
[Qu et al. 2014]
Step 1: Extract active subgraphs
Amiri, Chen, Prakash
Step 2: Dynamic graph segmentation
Alternative 2: Plain-graph-based analysis
 Drawbacks:
Inactive nodes are important to detect different patterns
Amiri, Chen, Prakash 10
Entire graph Active subgraph
Desirable Properties
 P1. Parameter-free:
• No threshold, No fixed granularity
 P2. Comprehensive:
• Use the entire graph
11Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
12Amiri, Chen, Prakash
Overview of SnapNETS
 Goal 1. Summarize each graph:
Keep structural and label dependent properties
 Goal 2. Construct Segmentation graph:
Define nodes and edges
Defining edges weights
o extract the features of summarized graphs
 Goal 3. Find the best segmentation:
Define the best segmentation (path)
Compute the best segmentation
13Amiri, Chen, Prakash
Technical Challenges
 Using the entire graph snapshots:
 Summarize graph while satisfying P2
 Finding the number of segments:
 Compute segmentation while satisfying P1
14
Reminder:
 P1. Parameter-free
 P2. Comprehensive
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
15Amiri, Chen, Prakash
Goal 1: Summarizing graph snapshots
 We want to preserve
 Structural properties
 Nodes labels
 Role of Eigenvalue:
16Amiri, Chen, Prakash
Same leading eigenvalue ( ) of Adjacency matrix Same diffusive
properties
Leading eigenvalue Epidemic threshold [Prakash et al. 2012]
Our Approach
 We want to get a smaller graph with similar eigenvalues:
Successively merge nodes
17Amiri, Chen, Prakash
Problem 2: Graph summarization
 Given: A graph with labeled nodes and a compression ratio.
 Find: a coarsened graph such that:
18Amiri, Chen, Prakash
 CoarseNet algorithm [Purohit et al.2014]
 Matrix perturbation approach
 Successively merge nodes
 Keep leading eigenvalue
 Our tweak
 Do not merge nodes with different labels
Problem 2: Graph
summarization
19
Given: A graph with labeled nodes and a compression ratio.
Find: a coarsened graph such that:
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
20Amiri, Chen, Prakash
 Nodes:
 For each segment there is a node + {Source (‘s’), Target (‘t’)}
 Edges:
 There is a directed edge between adjacent nodes
Goal 2: Segmentation graph
21Amiri, Chen, Prakash
Edge Weights
22
How can we measure the distance between two segments?
Amiri, Chen, Prakash
Our Approach
 Step 1: Extract features from summary graphs:
Easier and more efficient than on original graphs.
No complex features
23Amiri, Chen, Prakash
Step 2: Distance of adjacent segments
24
Edge Weights
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
25Amiri, Chen, Prakash
Goal 3: Finding the best segmentation
 Observation:
For each segmentation there is a path from ‘s’ to ‘t’
For each path from ‘s’ to ‘t’ there is a segmentation
 Therefore,
• Best segmentation problem ≡ Path optimization problem
26Amiri, Chen, Prakash
Possible approach
 Longest path?
27
S t. . .
S t
0.01 0.01 0.01 0.01
0.9 0.9 0.9
Sum = 3
Sum = 2.7
Over segmentation problem
Amiri, Chen, Prakash
Problem 3: Finding the best segmentation
 Our idea: Average longest path
 Advantages:
 Parameter free
 Naturally balances weight of the path with the number of segments.
28
Given a segmentation graph
Find the average longest path from ‘s’ to ‘t’
Amiri, Chen, Prakash
Solving ALP
 Finding the ALP in general graphs is NP-hard.
 The segmentation graph is a DAG ALP can be solved in
polynomial time
 Negative cycle detection [Waggoner et al. 2013]
29Amiri, Chen, Prakash
Complete algorithm
30
Time complexity:
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
31Amiri, Chen, Prakash
Experiments: datasets
 Different Domains with range of sizes:
 BA-degree: Random Barabasi Albert graph
 Higgs: Tweets dataset (with the follower-followee network)
 Memetracker: Who-copies-from-whom blog and website network
 DBLP: Co-authorship network related to ‘network’ topic.
32Amiri, Chen, Prakash
Experiments: baselines
 DYNAMMO [Li et al. 2009]:
 Feature Etraction & time series
 Change point detection ( Reconstruction errors)
 # segments = # segments of SnapNETS .
 VOG [Koutra et al. 2014]:
 Get active sub-graph
 10 most important sub-structures
 Cut when the set of sub-structures changes significantly
o (threshold = the one gives the best result)
 SN-LP:
 Longest Path instead of ALP
33Amiri, Chen, Prakash
Experiments: Quantitative analysis
34
 SnapNETS outperforms the baselines
 Clear patterns in summary graphs
We found Ground truth segmentation
As-Oregon
Amiri, Chen, Prakash
Case studies: Memetracker
35
Televised vice-presidential debates
 Summary graphs are close to
the case when all nodes have
the same label (f5)
 Random nodes are active (f8)
 Summary graphs are
substantially sparser (f2).
 Many active nodes got merged
into important nodes such as
CNN and BBC to form hubs (f6)
Amiri, Chen, Prakash
Case studies: AS-Oregon
36
 New community  New segment
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
37Amiri, Chen, Prakash
Conclusion: SnapNets
 Properties:
 P1. Parameter-free
 P2. Comprehensive
 Patterns:
 the ‘placement’ and ‘connection’ of active/inactive nodes:
• structural (e.g. community/role/centrality)
• rate changes.
 Global method:
 SnapNETS is a ‘global’ method and not simply a change-point detection
method.
38Amiri, Chen, Prakash
Future Work
 Faster ALP:  Linear?
 Handle dynamic graphs with varying
nodes and edges
 More node labels and real value features
 Work with partially observed graphs
39Amiri, Chen, Prakash
Any questions?
40
Funding:
Code at: https://github.com/SorourAmiri/SnapNETS
Sorour E. Amiri Liangzhe Chen B. Aditya Prakash
Goal 1 Goal 2 Goal 3
Finding the best segmentation
Successively merge nodes
Keep leading eigenvalue
Keep same set of labels
Graph summarization Segmentation graph
 Nodes
 Edges
 Edge weights
ALP
SnapNETS Result

More Related Content

PPTX
SnapNETS: Automatic Segmentation of Network Sequences with Node Labels
PDF
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
PDF
FULL PAPER.PDF
PDF
Topological Data Analysis of Complex Spatial Systems
PDF
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
PDF
Topological Data Analysis
PDF
Deep Convolutional 3D Object Classification from a Single Depth Image and Its...
PDF
Human Head Counting and Detection using Convnets
SnapNETS: Automatic Segmentation of Network Sequences with Node Labels
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
FULL PAPER.PDF
Topological Data Analysis of Complex Spatial Systems
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
Topological Data Analysis
Deep Convolutional 3D Object Classification from a Single Depth Image and Its...
Human Head Counting and Detection using Convnets

What's hot (15)

PDF
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
PDF
Learning with Relative Attributes
PDF
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
PDF
Ug 205-image-retrieval-using-re-ranking-algorithm-11
PDF
Paper id 36201507
PPTX
Wavelet, Wavelet Image Compression, STW, SPIHT, MATLAB
PDF
Scalable Dynamic Graph Summarization
PDF
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
PDF
Lidar Point Cloud Classification Using Expectation Maximization Algorithm
PDF
Secret-Fragment-Visible Mosaic Image-Creation and Recovery via Colour Transfo...
PPTX
17 Statistical Models for Networks
PPTX
Community detection algorithms
PDF
Tda presentation
PPTX
Community detection
PDF
Machine Learning Basics
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
Learning with Relative Attributes
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
Ug 205-image-retrieval-using-re-ranking-algorithm-11
Paper id 36201507
Wavelet, Wavelet Image Compression, STW, SPIHT, MATLAB
Scalable Dynamic Graph Summarization
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
Lidar Point Cloud Classification Using Expectation Maximization Algorithm
Secret-Fragment-Visible Mosaic Image-Creation and Recovery via Colour Transfo...
17 Statistical Models for Networks
Community detection algorithms
Tda presentation
Community detection
Machine Learning Basics
Ad

Viewers also liked (12)

PDF
2010,plant physiology and biochemistry,H2S in stomatal response
PPTX
Fisika gerak parabola
DOCX
vigneshwaran resume updated chennai
PDF
Capixabas reapresentam estudo sobre estoques de café conilon
PPTX
Renacimiento
PPT
Impuesto al valor_agregado_lret_para_revision_estudiantes_12_de_junio (7)
PDF
Criminalidade e pena de morte
PPTX
Presentacion
PDF
ATR - ESPAÇO CONFINADO
PDF
KHO, GEREMY VANDER-RESUME
DOC
Ramadan C.V
PDF
Encerrando um capítulo de mágoa
2010,plant physiology and biochemistry,H2S in stomatal response
Fisika gerak parabola
vigneshwaran resume updated chennai
Capixabas reapresentam estudo sobre estoques de café conilon
Renacimiento
Impuesto al valor_agregado_lret_para_revision_estudiantes_12_de_junio (7)
Criminalidade e pena de morte
Presentacion
ATR - ESPAÇO CONFINADO
KHO, GEREMY VANDER-RESUME
Ramadan C.V
Encerrando um capítulo de mágoa
Ad

Similar to Segmenting Sequences of Node-labeled Graphs (20)

PDF
High-Performance Graph Analysis and Modeling
PPTX
240729_Thuy_Labseminar[Substructure Aware Graph Neural Networks​].pptx
PDF
Time-Evolving Graph Processing On Commodity Clusters
PPTX
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
PDF
Graph Sample and Hold: A Framework for Big Graph Analytics
PDF
Representation Learning in Large Attributed Graphs
PPTX
NS-CUK Joint Journal Club : S.T.Nguyen, Review on "Graph Neural Networks for ...
PDF
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
PPTX
240722_Thuy_Labseminar[Unveiling Global Interactive Patterns across Graphs: T...
PPTX
Graph Analytics - From the Whiteboard to Your Toolbox - Sam Lerma
PPTX
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
PDF
Network analysis for computational biology
PPTX
Chapter 4 better.pptx
PDF
Learning Convolutional Neural Networks for Graphs
PPTX
NS - CUK Seminar: V.T.Hoang, Review on "Two-Dimensional Weisfeiler-Lehman Gra...
PDF
Final ppt
PDF
All pair shortest path--SDN
PDF
Maps of sparse memory networks reveal overlapping communities in network flows
PDF
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
PPTX
Data mining.pptx
High-Performance Graph Analysis and Modeling
240729_Thuy_Labseminar[Substructure Aware Graph Neural Networks​].pptx
Time-Evolving Graph Processing On Commodity Clusters
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
Graph Sample and Hold: A Framework for Big Graph Analytics
Representation Learning in Large Attributed Graphs
NS-CUK Joint Journal Club : S.T.Nguyen, Review on "Graph Neural Networks for ...
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
240722_Thuy_Labseminar[Unveiling Global Interactive Patterns across Graphs: T...
Graph Analytics - From the Whiteboard to Your Toolbox - Sam Lerma
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
Network analysis for computational biology
Chapter 4 better.pptx
Learning Convolutional Neural Networks for Graphs
NS - CUK Seminar: V.T.Hoang, Review on "Two-Dimensional Weisfeiler-Lehman Gra...
Final ppt
All pair shortest path--SDN
Maps of sparse memory networks reveal overlapping communities in network flows
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Data mining.pptx

Recently uploaded (20)

PDF
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
PPTX
research framework and review of related literature chapter 2
PPTX
langchainpptforbeginners_easy_explanation.pptx
PPTX
ifsm.pptx, institutional food service management
PDF
Mcdonald's : a half century growth . pdf
PDF
The Role of Pathology AI in Translational Cancer Research and Education
PPTX
DATA MODELING, data model concepts, types of data concepts
PPTX
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
PPTX
DATA ANALYTICS COURSE IN PITAMPURA.pptx
PPTX
Introduction to Fundamentals of Data Security
PPTX
transformers as a tool for understanding advance algorithms in deep learning
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PPTX
ch20 Database System Architecture by Rizvee
PPTX
Hushh.ai: Your Personal Data, Your Business
PDF
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
PDF
REPORT CARD OF GRADE 2 2025-2026 MATATAG
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
research framework and review of related literature chapter 2
langchainpptforbeginners_easy_explanation.pptx
ifsm.pptx, institutional food service management
Mcdonald's : a half century growth . pdf
The Role of Pathology AI in Translational Cancer Research and Education
DATA MODELING, data model concepts, types of data concepts
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
DATA ANALYTICS COURSE IN PITAMPURA.pptx
Introduction to Fundamentals of Data Security
transformers as a tool for understanding advance algorithms in deep learning
1 hour to get there before the game is done so you don’t need a car seat for ...
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
PPT for Diseases.pptx, there are 3 types of diseases
ch20 Database System Architecture by Rizvee
Hushh.ai: Your Personal Data, Your Business
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
REPORT CARD OF GRADE 2 2025-2026 MATATAG

Segmenting Sequences of Node-labeled Graphs

  • 1. Segmenting Sequences of Node-labeled Graphs Sorour E. Amiri, Liangzhe Chen, B. Aditya Prakash Department of Computer Science Virginia Tech ICDM, DaMNet, Barcelona, Spain, December 12, 2016
  • 2. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Experiments  Conclusion Amiri, Chen, Prakash 1
  • 3. Network Sequences  Epidemiology: disease spreads over contact networks  Social Media: Information spreads over friendship networks 2 Flu Meme Amiri, Chen, Prakash
  • 4. Making sense of network sequences 3 Flu when do the infection patterns change? Star Bridge Near Clique Reason: • Virus mutation • Vaccination • … Amiri, Chen, Prakash
  • 5. Making sense of network sequences 4 Meme Reason: • Event • … Star Clique when do the infection patterns change? Amiri, Chen, Prakash
  • 6. Problem 1: Network sequence segmentation  Given a sequence of networks with labeled nodes,  Find the best segmentation which captures:  Different distribution of node labels. 5 Star Bridge Near Clique Amiri, Chen, Prakash
  • 7. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Experiments  Conclusion 6Amiri, Chen, Prakash
  • 8. Alternative 1: Feature Ext. &Time-series 7 0 0 0 … 2F1: #cliques (of active subgraph) F2: #ladders (of inactive subgraph) F3: #ladders (of active subgraph) 1 1 0 … 0 0 0 0 … 1 [Henderson et al. 2010] [Likas, Vlassis, and Verbeek 2003] [Li et al. 2009] Amiri, Chen, Prakash -1 0 1 2 G1 G2 G3 G4 Features time series F1 F2 F3 Step 1: Feature Extraction Step 2: Time-series segmentation
  • 9. Alternative 1: Feature Ext. &Time-series  Drawbacks:  Laborious feature-engineering  “Local” change detection: o One aggregation time period o Threshold 8Amiri, Chen, Prakash -1 0 1 2 G1 G2 G3 G4 Features time series F1 F2 F3
  • 10. Alternative 2: Plain-graph-based analysis 9 [Shah et al. 2015] [Sun et al. 2007] [Lin et al. 2009] [Qu et al. 2014] Step 1: Extract active subgraphs Amiri, Chen, Prakash Step 2: Dynamic graph segmentation
  • 11. Alternative 2: Plain-graph-based analysis  Drawbacks: Inactive nodes are important to detect different patterns Amiri, Chen, Prakash 10 Entire graph Active subgraph
  • 12. Desirable Properties  P1. Parameter-free: • No threshold, No fixed granularity  P2. Comprehensive: • Use the entire graph 11Amiri, Chen, Prakash
  • 13. Outline  Motivation  Background  Our Proposed Method: SnapNETS Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 12Amiri, Chen, Prakash
  • 14. Overview of SnapNETS  Goal 1. Summarize each graph: Keep structural and label dependent properties  Goal 2. Construct Segmentation graph: Define nodes and edges Defining edges weights o extract the features of summarized graphs  Goal 3. Find the best segmentation: Define the best segmentation (path) Compute the best segmentation 13Amiri, Chen, Prakash
  • 15. Technical Challenges  Using the entire graph snapshots:  Summarize graph while satisfying P2  Finding the number of segments:  Compute segmentation while satisfying P1 14 Reminder:  P1. Parameter-free  P2. Comprehensive Amiri, Chen, Prakash
  • 16. Outline  Motivation  Background  Our Proposed Method: SnapNETS Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 15Amiri, Chen, Prakash
  • 17. Goal 1: Summarizing graph snapshots  We want to preserve  Structural properties  Nodes labels  Role of Eigenvalue: 16Amiri, Chen, Prakash Same leading eigenvalue ( ) of Adjacency matrix Same diffusive properties Leading eigenvalue Epidemic threshold [Prakash et al. 2012]
  • 18. Our Approach  We want to get a smaller graph with similar eigenvalues: Successively merge nodes 17Amiri, Chen, Prakash
  • 19. Problem 2: Graph summarization  Given: A graph with labeled nodes and a compression ratio.  Find: a coarsened graph such that: 18Amiri, Chen, Prakash
  • 20.  CoarseNet algorithm [Purohit et al.2014]  Matrix perturbation approach  Successively merge nodes  Keep leading eigenvalue  Our tweak  Do not merge nodes with different labels Problem 2: Graph summarization 19 Given: A graph with labeled nodes and a compression ratio. Find: a coarsened graph such that: Amiri, Chen, Prakash
  • 21. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 20Amiri, Chen, Prakash
  • 22.  Nodes:  For each segment there is a node + {Source (‘s’), Target (‘t’)}  Edges:  There is a directed edge between adjacent nodes Goal 2: Segmentation graph 21Amiri, Chen, Prakash
  • 23. Edge Weights 22 How can we measure the distance between two segments? Amiri, Chen, Prakash
  • 24. Our Approach  Step 1: Extract features from summary graphs: Easier and more efficient than on original graphs. No complex features 23Amiri, Chen, Prakash
  • 25. Step 2: Distance of adjacent segments 24 Edge Weights Amiri, Chen, Prakash
  • 26. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 25Amiri, Chen, Prakash
  • 27. Goal 3: Finding the best segmentation  Observation: For each segmentation there is a path from ‘s’ to ‘t’ For each path from ‘s’ to ‘t’ there is a segmentation  Therefore, • Best segmentation problem ≡ Path optimization problem 26Amiri, Chen, Prakash
  • 28. Possible approach  Longest path? 27 S t. . . S t 0.01 0.01 0.01 0.01 0.9 0.9 0.9 Sum = 3 Sum = 2.7 Over segmentation problem Amiri, Chen, Prakash
  • 29. Problem 3: Finding the best segmentation  Our idea: Average longest path  Advantages:  Parameter free  Naturally balances weight of the path with the number of segments. 28 Given a segmentation graph Find the average longest path from ‘s’ to ‘t’ Amiri, Chen, Prakash
  • 30. Solving ALP  Finding the ALP in general graphs is NP-hard.  The segmentation graph is a DAG ALP can be solved in polynomial time  Negative cycle detection [Waggoner et al. 2013] 29Amiri, Chen, Prakash
  • 32. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 31Amiri, Chen, Prakash
  • 33. Experiments: datasets  Different Domains with range of sizes:  BA-degree: Random Barabasi Albert graph  Higgs: Tweets dataset (with the follower-followee network)  Memetracker: Who-copies-from-whom blog and website network  DBLP: Co-authorship network related to ‘network’ topic. 32Amiri, Chen, Prakash
  • 34. Experiments: baselines  DYNAMMO [Li et al. 2009]:  Feature Etraction & time series  Change point detection ( Reconstruction errors)  # segments = # segments of SnapNETS .  VOG [Koutra et al. 2014]:  Get active sub-graph  10 most important sub-structures  Cut when the set of sub-structures changes significantly o (threshold = the one gives the best result)  SN-LP:  Longest Path instead of ALP 33Amiri, Chen, Prakash
  • 35. Experiments: Quantitative analysis 34  SnapNETS outperforms the baselines  Clear patterns in summary graphs We found Ground truth segmentation As-Oregon Amiri, Chen, Prakash
  • 36. Case studies: Memetracker 35 Televised vice-presidential debates  Summary graphs are close to the case when all nodes have the same label (f5)  Random nodes are active (f8)  Summary graphs are substantially sparser (f2).  Many active nodes got merged into important nodes such as CNN and BBC to form hubs (f6) Amiri, Chen, Prakash
  • 37. Case studies: AS-Oregon 36  New community  New segment Amiri, Chen, Prakash
  • 38. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 37Amiri, Chen, Prakash
  • 39. Conclusion: SnapNets  Properties:  P1. Parameter-free  P2. Comprehensive  Patterns:  the ‘placement’ and ‘connection’ of active/inactive nodes: • structural (e.g. community/role/centrality) • rate changes.  Global method:  SnapNETS is a ‘global’ method and not simply a change-point detection method. 38Amiri, Chen, Prakash
  • 40. Future Work  Faster ALP:  Linear?  Handle dynamic graphs with varying nodes and edges  More node labels and real value features  Work with partially observed graphs 39Amiri, Chen, Prakash
  • 41. Any questions? 40 Funding: Code at: https://github.com/SorourAmiri/SnapNETS Sorour E. Amiri Liangzhe Chen B. Aditya Prakash Goal 1 Goal 2 Goal 3 Finding the best segmentation Successively merge nodes Keep leading eigenvalue Keep same set of labels Graph summarization Segmentation graph  Nodes  Edges  Edge weights ALP SnapNETS Result

Editor's Notes

  • #23: Smaller size summarization maintains the relevant important properties effectively
  • #24: Smaller size summarization maintains the relevant important properties effectively
  • #25: Smaller size summarization maintains the relevant important properties effectively
  • #26: Smaller size summarization maintains the relevant important properties effectively
  • #34: BA-degree: We activate highest degree and then lowest degree nodes on a
  • #35: BA-degree: We activate highest degree and then lowest degree nodes on a