Segmenting Sequences of
Node-labeled Graphs
Sorour E. Amiri, Liangzhe Chen, B. Aditya Prakash
Department of Computer Science
Virginia Tech
ICDM, DaMNet, Barcelona, Spain, December 12, 2016
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Experiments
 Conclusion
Amiri, Chen, Prakash 1
Network Sequences
 Epidemiology: disease spreads over contact networks
 Social Media: Information spreads over friendship networks
2
Flu
Meme
Amiri, Chen, Prakash
Making sense of network sequences
3
Flu
when do the infection patterns change?
Star Bridge Near Clique
Reason:
• Virus mutation
• Vaccination
• …
Amiri, Chen, Prakash
Making sense of network sequences
4
Meme Reason:
• Event
• …
Star Clique
when do the infection patterns change?
Amiri, Chen, Prakash
Problem 1: Network sequence segmentation
 Given a sequence of networks with labeled nodes,
 Find the best segmentation which captures:
 Different distribution of node labels.
5
Star Bridge Near Clique
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Experiments
 Conclusion
6Amiri, Chen, Prakash
Alternative 1: Feature Ext. &Time-series
7
0 0 0 … 2F1: #cliques (of active subgraph)
F2: #ladders (of inactive subgraph)
F3: #ladders (of active subgraph)
1 1 0 … 0
0 0 0 … 1
[Henderson et al. 2010]
[Likas, Vlassis, and Verbeek 2003]
[Li et al. 2009]
Amiri, Chen, Prakash
-1
0
1
2
G1 G2 G3 G4
Features time series
F1 F2 F3
Step 1: Feature Extraction
Step 2: Time-series
segmentation
Alternative 1: Feature Ext. &Time-series
 Drawbacks:
 Laborious feature-engineering
 “Local” change detection:
o One aggregation time period
o Threshold
8Amiri, Chen, Prakash
-1
0
1
2
G1 G2 G3 G4
Features time series
F1 F2 F3
Alternative 2: Plain-graph-based analysis
9
[Shah et al. 2015]
[Sun et al. 2007]
[Lin et al. 2009]
[Qu et al. 2014]
Step 1: Extract active subgraphs
Amiri, Chen, Prakash
Step 2: Dynamic graph segmentation
Alternative 2: Plain-graph-based analysis
 Drawbacks:
Inactive nodes are important to detect different patterns
Amiri, Chen, Prakash 10
Entire graph Active subgraph
Desirable Properties
 P1. Parameter-free:
• No threshold, No fixed granularity
 P2. Comprehensive:
• Use the entire graph
11Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
12Amiri, Chen, Prakash
Overview of SnapNETS
 Goal 1. Summarize each graph:
Keep structural and label dependent properties
 Goal 2. Construct Segmentation graph:
Define nodes and edges
Defining edges weights
o extract the features of summarized graphs
 Goal 3. Find the best segmentation:
Define the best segmentation (path)
Compute the best segmentation
13Amiri, Chen, Prakash
Technical Challenges
 Using the entire graph snapshots:
 Summarize graph while satisfying P2
 Finding the number of segments:
 Compute segmentation while satisfying P1
14
Reminder:
 P1. Parameter-free
 P2. Comprehensive
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
15Amiri, Chen, Prakash
Goal 1: Summarizing graph snapshots
 We want to preserve
 Structural properties
 Nodes labels
 Role of Eigenvalue:
16Amiri, Chen, Prakash
Same leading eigenvalue ( ) of Adjacency matrix Same diffusive
properties
Leading eigenvalue Epidemic threshold [Prakash et al. 2012]
Our Approach
 We want to get a smaller graph with similar eigenvalues:
Successively merge nodes
17Amiri, Chen, Prakash
Problem 2: Graph summarization
 Given: A graph with labeled nodes and a compression ratio.
 Find: a coarsened graph such that:
18Amiri, Chen, Prakash
 CoarseNet algorithm [Purohit et al.2014]
 Matrix perturbation approach
 Successively merge nodes
 Keep leading eigenvalue
 Our tweak
 Do not merge nodes with different labels
Problem 2: Graph
summarization
19
Given: A graph with labeled nodes and a compression ratio.
Find: a coarsened graph such that:
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
20Amiri, Chen, Prakash
 Nodes:
 For each segment there is a node + {Source (‘s’), Target (‘t’)}
 Edges:
 There is a directed edge between adjacent nodes
Goal 2: Segmentation graph
21Amiri, Chen, Prakash
Edge Weights
22
How can we measure the distance between two segments?
Amiri, Chen, Prakash
Our Approach
 Step 1: Extract features from summary graphs:
Easier and more efficient than on original graphs.
No complex features
23Amiri, Chen, Prakash
Step 2: Distance of adjacent segments
24
Edge Weights
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
25Amiri, Chen, Prakash
Goal 3: Finding the best segmentation
 Observation:
For each segmentation there is a path from ‘s’ to ‘t’
For each path from ‘s’ to ‘t’ there is a segmentation
 Therefore,
• Best segmentation problem ≡ Path optimization problem
26Amiri, Chen, Prakash
Possible approach
 Longest path?
27
S t. . .
S t
0.01 0.01 0.01 0.01
0.9 0.9 0.9
Sum = 3
Sum = 2.7
Over segmentation problem
Amiri, Chen, Prakash
Problem 3: Finding the best segmentation
 Our idea: Average longest path
 Advantages:
 Parameter free
 Naturally balances weight of the path with the number of segments.
28
Given a segmentation graph
Find the average longest path from ‘s’ to ‘t’
Amiri, Chen, Prakash
Solving ALP
 Finding the ALP in general graphs is NP-hard.
 The segmentation graph is a DAG ALP can be solved in
polynomial time
 Negative cycle detection [Waggoner et al. 2013]
29Amiri, Chen, Prakash
Complete algorithm
30
Time complexity:
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
31Amiri, Chen, Prakash
Experiments: datasets
 Different Domains with range of sizes:
 BA-degree: Random Barabasi Albert graph
 Higgs: Tweets dataset (with the follower-followee network)
 Memetracker: Who-copies-from-whom blog and website network
 DBLP: Co-authorship network related to ‘network’ topic.
32Amiri, Chen, Prakash
Experiments: baselines
 DYNAMMO [Li et al. 2009]:
 Feature Etraction & time series
 Change point detection ( Reconstruction errors)
 # segments = # segments of SnapNETS .
 VOG [Koutra et al. 2014]:
 Get active sub-graph
 10 most important sub-structures
 Cut when the set of sub-structures changes significantly
o (threshold = the one gives the best result)
 SN-LP:
 Longest Path instead of ALP
33Amiri, Chen, Prakash
Experiments: Quantitative analysis
34
 SnapNETS outperforms the baselines
 Clear patterns in summary graphs
We found Ground truth segmentation
As-Oregon
Amiri, Chen, Prakash
Case studies: Memetracker
35
Televised vice-presidential debates
 Summary graphs are close to
the case when all nodes have
the same label (f5)
 Random nodes are active (f8)
 Summary graphs are
substantially sparser (f2).
 Many active nodes got merged
into important nodes such as
CNN and BBC to form hubs (f6)
Amiri, Chen, Prakash
Case studies: AS-Oregon
36
 New community  New segment
Amiri, Chen, Prakash
Outline
 Motivation
 Background
 Our Proposed Method: SnapNETS
 Overview
 Goal 1: Summarizing Act-snapshots
 Goal 2: Constructing the segmentation graph
 Goal 3: Finding the best segmentation
 Experiments
 Conclusion
37Amiri, Chen, Prakash
Conclusion: SnapNets
 Properties:
 P1. Parameter-free
 P2. Comprehensive
 Patterns:
 the ‘placement’ and ‘connection’ of active/inactive nodes:
• structural (e.g. community/role/centrality)
• rate changes.
 Global method:
 SnapNETS is a ‘global’ method and not simply a change-point detection
method.
38Amiri, Chen, Prakash
Future Work
 Faster ALP:  Linear?
 Handle dynamic graphs with varying
nodes and edges
 More node labels and real value features
 Work with partially observed graphs
39Amiri, Chen, Prakash
Any questions?
40
Funding:
Code at: https://github.com/SorourAmiri/SnapNETS
Sorour E. Amiri Liangzhe Chen B. Aditya Prakash
Goal 1 Goal 2 Goal 3
Finding the best segmentation
Successively merge nodes
Keep leading eigenvalue
Keep same set of labels
Graph summarization Segmentation graph
 Nodes
 Edges
 Edge weights
ALP
SnapNETS Result

More Related Content

PPTX
SnapNETS: Automatic Segmentation of Network Sequences with Node Labels
PDF
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
PDF
FULL PAPER.PDF
PDF
Topological Data Analysis of Complex Spatial Systems
PDF
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
PDF
Topological Data Analysis
PDF
Deep Convolutional 3D Object Classification from a Single Depth Image and Its...
PDF
Human Head Counting and Detection using Convnets
SnapNETS: Automatic Segmentation of Network Sequences with Node Labels
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
FULL PAPER.PDF
Topological Data Analysis of Complex Spatial Systems
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
Topological Data Analysis
Deep Convolutional 3D Object Classification from a Single Depth Image and Its...
Human Head Counting and Detection using Convnets

What's hot (15)

PDF
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
PDF
Learning with Relative Attributes
PDF
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
PDF
Ug 205-image-retrieval-using-re-ranking-algorithm-11
PDF
Paper id 36201507
PPTX
Wavelet, Wavelet Image Compression, STW, SPIHT, MATLAB
PDF
Scalable Dynamic Graph Summarization
PDF
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
PDF
Lidar Point Cloud Classification Using Expectation Maximization Algorithm
PDF
Secret-Fragment-Visible Mosaic Image-Creation and Recovery via Colour Transfo...
PPTX
17 Statistical Models for Networks
PPTX
Community detection algorithms
PDF
Tda presentation
PPTX
Community detection
PDF
Machine Learning Basics
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
Learning with Relative Attributes
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
Ug 205-image-retrieval-using-re-ranking-algorithm-11
Paper id 36201507
Wavelet, Wavelet Image Compression, STW, SPIHT, MATLAB
Scalable Dynamic Graph Summarization
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
Lidar Point Cloud Classification Using Expectation Maximization Algorithm
Secret-Fragment-Visible Mosaic Image-Creation and Recovery via Colour Transfo...
17 Statistical Models for Networks
Community detection algorithms
Tda presentation
Community detection
Machine Learning Basics
Ad

Viewers also liked (12)

PDF
2010,plant physiology and biochemistry,H2S in stomatal response
PPTX
Fisika gerak parabola
DOCX
vigneshwaran resume updated chennai
PDF
Capixabas reapresentam estudo sobre estoques de café conilon
PPTX
Renacimiento
PPT
Impuesto al valor_agregado_lret_para_revision_estudiantes_12_de_junio (7)
PDF
Criminalidade e pena de morte
PPTX
Presentacion
PDF
ATR - ESPAÇO CONFINADO
PDF
KHO, GEREMY VANDER-RESUME
DOC
Ramadan C.V
PDF
Encerrando um capítulo de mágoa
2010,plant physiology and biochemistry,H2S in stomatal response
Fisika gerak parabola
vigneshwaran resume updated chennai
Capixabas reapresentam estudo sobre estoques de café conilon
Renacimiento
Impuesto al valor_agregado_lret_para_revision_estudiantes_12_de_junio (7)
Criminalidade e pena de morte
Presentacion
ATR - ESPAÇO CONFINADO
KHO, GEREMY VANDER-RESUME
Ramadan C.V
Encerrando um capítulo de mágoa
Ad

Similar to Segmenting Sequences of Node-labeled Graphs (20)

PDF
Machine_Learining_Concepts_DecisionTrees&PCA.pdf
PPTX
PERT. ppts for operation researchs assgn.pptx
PDF
Module - 5 Machine Learning-22ISE62.pdf
PPTX
ML basic & clustering
PPTX
Predict saturated thickness using tensor board visualization
PPTX
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
PPTX
Introduction to Datamining Concept and Techniques
PDF
Mat189: Cluster Analysis with NBA Sports Data
PPTX
PPTX
LRP for hand gesture recogntion.pptx
PDF
A New Method Based on MDA to Enhance the Face Recognition Performance
PPTX
Avi-newmans_fast_community_detection.pptx
PDF
Introduction to data mining and machine learning
DOCX
K means report
PDF
Current clustering techniques
PDF
Using Networks to Measure Influence and Impact
PDF
Overlapping community detection in Large-Scale Networks using BigCLAM model b...
PDF
1. Introduction to Algorithms, Specification of Algorithm, Complexity.pdf
PDF
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
PPTX
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Machine_Learining_Concepts_DecisionTrees&PCA.pdf
PERT. ppts for operation researchs assgn.pptx
Module - 5 Machine Learning-22ISE62.pdf
ML basic & clustering
Predict saturated thickness using tensor board visualization
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
Introduction to Datamining Concept and Techniques
Mat189: Cluster Analysis with NBA Sports Data
LRP for hand gesture recogntion.pptx
A New Method Based on MDA to Enhance the Face Recognition Performance
Avi-newmans_fast_community_detection.pptx
Introduction to data mining and machine learning
K means report
Current clustering techniques
Using Networks to Measure Influence and Impact
Overlapping community detection in Large-Scale Networks using BigCLAM model b...
1. Introduction to Algorithms, Specification of Algorithm, Complexity.pdf
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
Developing Computational Skills in the Sciences with Matlab Webinar 2017

Recently uploaded (20)

PPTX
Tapan_20220802057_Researchinternship_final_stage.pptx
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
IMPACT OF LANDSLIDE.....................
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
Machine Learning and working of machine Learning
PDF
Navigating the Thai Supplements Landscape.pdf
PPTX
chrmotography.pptx food anaylysis techni
PPTX
eGramSWARAJ-PPT Training Module for beginners
PPTX
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
Microsoft 365 products and services descrption
PPTX
ai agent creaction with langgraph_presentation_
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
SET 1 Compulsory MNH machine learning intro
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
Tapan_20220802057_Researchinternship_final_stage.pptx
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
CYBER SECURITY the Next Warefare Tactics
IMPACT OF LANDSLIDE.....................
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
Machine Learning and working of machine Learning
Navigating the Thai Supplements Landscape.pdf
chrmotography.pptx food anaylysis techni
eGramSWARAJ-PPT Training Module for beginners
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
DU, AIS, Big Data and Data Analytics.ppt
Microsoft 365 products and services descrption
ai agent creaction with langgraph_presentation_
expt-design-lecture-12 hghhgfggjhjd (1).ppt
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
SET 1 Compulsory MNH machine learning intro
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded

Segmenting Sequences of Node-labeled Graphs

  • 1. Segmenting Sequences of Node-labeled Graphs Sorour E. Amiri, Liangzhe Chen, B. Aditya Prakash Department of Computer Science Virginia Tech ICDM, DaMNet, Barcelona, Spain, December 12, 2016
  • 2. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Experiments  Conclusion Amiri, Chen, Prakash 1
  • 3. Network Sequences  Epidemiology: disease spreads over contact networks  Social Media: Information spreads over friendship networks 2 Flu Meme Amiri, Chen, Prakash
  • 4. Making sense of network sequences 3 Flu when do the infection patterns change? Star Bridge Near Clique Reason: • Virus mutation • Vaccination • … Amiri, Chen, Prakash
  • 5. Making sense of network sequences 4 Meme Reason: • Event • … Star Clique when do the infection patterns change? Amiri, Chen, Prakash
  • 6. Problem 1: Network sequence segmentation  Given a sequence of networks with labeled nodes,  Find the best segmentation which captures:  Different distribution of node labels. 5 Star Bridge Near Clique Amiri, Chen, Prakash
  • 7. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Experiments  Conclusion 6Amiri, Chen, Prakash
  • 8. Alternative 1: Feature Ext. &Time-series 7 0 0 0 … 2F1: #cliques (of active subgraph) F2: #ladders (of inactive subgraph) F3: #ladders (of active subgraph) 1 1 0 … 0 0 0 0 … 1 [Henderson et al. 2010] [Likas, Vlassis, and Verbeek 2003] [Li et al. 2009] Amiri, Chen, Prakash -1 0 1 2 G1 G2 G3 G4 Features time series F1 F2 F3 Step 1: Feature Extraction Step 2: Time-series segmentation
  • 9. Alternative 1: Feature Ext. &Time-series  Drawbacks:  Laborious feature-engineering  “Local” change detection: o One aggregation time period o Threshold 8Amiri, Chen, Prakash -1 0 1 2 G1 G2 G3 G4 Features time series F1 F2 F3
  • 10. Alternative 2: Plain-graph-based analysis 9 [Shah et al. 2015] [Sun et al. 2007] [Lin et al. 2009] [Qu et al. 2014] Step 1: Extract active subgraphs Amiri, Chen, Prakash Step 2: Dynamic graph segmentation
  • 11. Alternative 2: Plain-graph-based analysis  Drawbacks: Inactive nodes are important to detect different patterns Amiri, Chen, Prakash 10 Entire graph Active subgraph
  • 12. Desirable Properties  P1. Parameter-free: • No threshold, No fixed granularity  P2. Comprehensive: • Use the entire graph 11Amiri, Chen, Prakash
  • 13. Outline  Motivation  Background  Our Proposed Method: SnapNETS Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 12Amiri, Chen, Prakash
  • 14. Overview of SnapNETS  Goal 1. Summarize each graph: Keep structural and label dependent properties  Goal 2. Construct Segmentation graph: Define nodes and edges Defining edges weights o extract the features of summarized graphs  Goal 3. Find the best segmentation: Define the best segmentation (path) Compute the best segmentation 13Amiri, Chen, Prakash
  • 15. Technical Challenges  Using the entire graph snapshots:  Summarize graph while satisfying P2  Finding the number of segments:  Compute segmentation while satisfying P1 14 Reminder:  P1. Parameter-free  P2. Comprehensive Amiri, Chen, Prakash
  • 16. Outline  Motivation  Background  Our Proposed Method: SnapNETS Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 15Amiri, Chen, Prakash
  • 17. Goal 1: Summarizing graph snapshots  We want to preserve  Structural properties  Nodes labels  Role of Eigenvalue: 16Amiri, Chen, Prakash Same leading eigenvalue ( ) of Adjacency matrix Same diffusive properties Leading eigenvalue Epidemic threshold [Prakash et al. 2012]
  • 18. Our Approach  We want to get a smaller graph with similar eigenvalues: Successively merge nodes 17Amiri, Chen, Prakash
  • 19. Problem 2: Graph summarization  Given: A graph with labeled nodes and a compression ratio.  Find: a coarsened graph such that: 18Amiri, Chen, Prakash
  • 20.  CoarseNet algorithm [Purohit et al.2014]  Matrix perturbation approach  Successively merge nodes  Keep leading eigenvalue  Our tweak  Do not merge nodes with different labels Problem 2: Graph summarization 19 Given: A graph with labeled nodes and a compression ratio. Find: a coarsened graph such that: Amiri, Chen, Prakash
  • 21. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 20Amiri, Chen, Prakash
  • 22.  Nodes:  For each segment there is a node + {Source (‘s’), Target (‘t’)}  Edges:  There is a directed edge between adjacent nodes Goal 2: Segmentation graph 21Amiri, Chen, Prakash
  • 23. Edge Weights 22 How can we measure the distance between two segments? Amiri, Chen, Prakash
  • 24. Our Approach  Step 1: Extract features from summary graphs: Easier and more efficient than on original graphs. No complex features 23Amiri, Chen, Prakash
  • 25. Step 2: Distance of adjacent segments 24 Edge Weights Amiri, Chen, Prakash
  • 26. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 25Amiri, Chen, Prakash
  • 27. Goal 3: Finding the best segmentation  Observation: For each segmentation there is a path from ‘s’ to ‘t’ For each path from ‘s’ to ‘t’ there is a segmentation  Therefore, • Best segmentation problem ≡ Path optimization problem 26Amiri, Chen, Prakash
  • 28. Possible approach  Longest path? 27 S t. . . S t 0.01 0.01 0.01 0.01 0.9 0.9 0.9 Sum = 3 Sum = 2.7 Over segmentation problem Amiri, Chen, Prakash
  • 29. Problem 3: Finding the best segmentation  Our idea: Average longest path  Advantages:  Parameter free  Naturally balances weight of the path with the number of segments. 28 Given a segmentation graph Find the average longest path from ‘s’ to ‘t’ Amiri, Chen, Prakash
  • 30. Solving ALP  Finding the ALP in general graphs is NP-hard.  The segmentation graph is a DAG ALP can be solved in polynomial time  Negative cycle detection [Waggoner et al. 2013] 29Amiri, Chen, Prakash
  • 32. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 31Amiri, Chen, Prakash
  • 33. Experiments: datasets  Different Domains with range of sizes:  BA-degree: Random Barabasi Albert graph  Higgs: Tweets dataset (with the follower-followee network)  Memetracker: Who-copies-from-whom blog and website network  DBLP: Co-authorship network related to ‘network’ topic. 32Amiri, Chen, Prakash
  • 34. Experiments: baselines  DYNAMMO [Li et al. 2009]:  Feature Etraction & time series  Change point detection ( Reconstruction errors)  # segments = # segments of SnapNETS .  VOG [Koutra et al. 2014]:  Get active sub-graph  10 most important sub-structures  Cut when the set of sub-structures changes significantly o (threshold = the one gives the best result)  SN-LP:  Longest Path instead of ALP 33Amiri, Chen, Prakash
  • 35. Experiments: Quantitative analysis 34  SnapNETS outperforms the baselines  Clear patterns in summary graphs We found Ground truth segmentation As-Oregon Amiri, Chen, Prakash
  • 36. Case studies: Memetracker 35 Televised vice-presidential debates  Summary graphs are close to the case when all nodes have the same label (f5)  Random nodes are active (f8)  Summary graphs are substantially sparser (f2).  Many active nodes got merged into important nodes such as CNN and BBC to form hubs (f6) Amiri, Chen, Prakash
  • 37. Case studies: AS-Oregon 36  New community  New segment Amiri, Chen, Prakash
  • 38. Outline  Motivation  Background  Our Proposed Method: SnapNETS  Overview  Goal 1: Summarizing Act-snapshots  Goal 2: Constructing the segmentation graph  Goal 3: Finding the best segmentation  Experiments  Conclusion 37Amiri, Chen, Prakash
  • 39. Conclusion: SnapNets  Properties:  P1. Parameter-free  P2. Comprehensive  Patterns:  the ‘placement’ and ‘connection’ of active/inactive nodes: • structural (e.g. community/role/centrality) • rate changes.  Global method:  SnapNETS is a ‘global’ method and not simply a change-point detection method. 38Amiri, Chen, Prakash
  • 40. Future Work  Faster ALP:  Linear?  Handle dynamic graphs with varying nodes and edges  More node labels and real value features  Work with partially observed graphs 39Amiri, Chen, Prakash
  • 41. Any questions? 40 Funding: Code at: https://github.com/SorourAmiri/SnapNETS Sorour E. Amiri Liangzhe Chen B. Aditya Prakash Goal 1 Goal 2 Goal 3 Finding the best segmentation Successively merge nodes Keep leading eigenvalue Keep same set of labels Graph summarization Segmentation graph  Nodes  Edges  Edge weights ALP SnapNETS Result

Editor's Notes

  • #23: Smaller size summarization maintains the relevant important properties effectively
  • #24: Smaller size summarization maintains the relevant important properties effectively
  • #25: Smaller size summarization maintains the relevant important properties effectively
  • #26: Smaller size summarization maintains the relevant important properties effectively
  • #34: BA-degree: We activate highest degree and then lowest degree nodes on a
  • #35: BA-degree: We activate highest degree and then lowest degree nodes on a