SlideShare a Scribd company logo
Amy E. Hodler
Graph Analytics & AI Program Manager, Neo4j
Amy.Hodler@neo4j.com @amyhodler
How Graphs Enhance AI
Graphs in Data Science
Advancing through the Steps of
Graph Data Science
How Graphs Enhance AI
How Graphs Enhance AI
Predicting Financial Contagion
from Global to Local
Financial Crimes Drug Discovery Recommendations
Cybersecurity Predictive Maintenance
Customer Segmentation
Churn Prediction Search/MDM
Graphs Data Science Applications
“The idea is that graph networks are bigger than
any one machine-learning approach.
Graphs bring an ability to generalize about
structure that the individual neural nets don't have.”
"Where do the graphs
come from that
graph networks
operate over?”
Getting Started
7
Building a Graph ML Model
Data
Sources
Native Graph
Platform
Machine
Learning
Aggregate Disparate
Data and Cleanse
Build Predictive
Models
Unify Graphs and
Engineer Features
Parquet JSON
and more…
MLlib
and more…
Spark Graph Native Graph
Platform
Machine Learning
Example: Spark & Neo4j Workflow
Graph
Transactions
Graph
Analytics
Cypher 9 in Spark 3.0
to create non-
persistent graphs
MLlib to Train Models
Native Graph Algorithms,
Processing, and Storage
Morpheus
integration
Explore Graphs Build Graph Solutions
• Massively scalable
• Powerful data pipelining
• Robust ML Libraries
• Non-persistent, non-native graphs
• Persistent, dynamic graphs
• Graph native query and algorithm
performance
• Constantly growing list of graph
algorithms and embeddings
Steps Forward
11
Steps Forward in Graph Data Science
Graph Persistence
Knowledge
Graphs
Connected Feature
Engineering
Graph Native
Learning
Steps Forward in Graph Data Science
Query Based
Knowledge
Graph
Query Based
Feature
Engineering
Graph
Algorithm
Feature
Engineering
Graph
Embeddings
Graph Neural
Networks
Enterprise Delivery
DataScienceComplexity
Knowledge
Graphs
Graph Feature
Engineering
Graph Native
Learning
Graph Persistence
Steps Forward in Graph Data Science
Query Based
Knowledge
Graph
Query Based
Feature
Engineering
Graph Algorithm
Feature
Engineering
Graph
Embeddings
Graph Neural
Networks
Enterprise Maturity
DataScienceComplexity
Query-Based Knowledge Graphs
Connecting the Dots
• Multiple graph layers of financial
information
• Includes corporate data with
cross-relationships, external
news, and customized weighting
• Dashboards and tools
• Credit risk
• Investment risk
• Portfolio news recommendations
Steps Forward in Graph Data Science
Query Based
Knowledge
Graph
Graph
Algorithm
Feature
Engineering
Graph
Embeddings
Graph Neural
Networks
Query Based
Feature
Engineering
Enterprise Maturity
DataScienceComplexity
HetioNet is a knowledge
graph integrating over 50
years of biomedical data
Leveraged to predict new
uses for drugs by using the
graph topology to create
features to predict new links
Query-Based Feature Engineering
Mining Data for Drug Discovery
het.io
Query-Based Feature Engineering
Mining Data for Drug Discovery
HetioNet is a knowledge
graph integrating over 50
years of biomedical data
Leveraged to predict new
uses for drugs by using the
graph topology to create
features to predict new links
het.io
Query-Based Feature Engineering
Mining Data for Drug Discovery
Spark Graph Native Graph
Platform
Machine Learning
• Merge distributed data
into DataFrames
• Reshape your tables
into graphs
• Explore cypher queries
• Move to Neo4j to build
expert queries
• Persist your graph
Knowledge Graphs:
Getting Started Example with Spark
• Bring query based
graph features to ML
pipeline
Graph
Transactions
Graph
Analytics
Steps Forward in Graph Data Science
Query Based
Feature
Engineering
Graph
Embeddings
Graph Neural
Networks
Query Based
Knowledge
Graph
Graph
Algorithm
Feature
Engineering
Enterprise Maturity
DataScienceComplexity
Feature Engineering is how we combine and process the
data to create new, more meaningful features, such as
clustering or connectivity metrics.
Graph Connected Feature Engineering
Add More Descriptive Features:
- Influence
- Relationships
- Communities
Extraction
23
Graph Feature Categories & Algorithms
Pathfinding
& Search
Finds the optimal paths or evaluates
route availability and quality
Centrality /
Importance
Determines the importance of
distinct nodes in the network
Community
Detection
Detects group clustering or
partition options
Heuristic
Link Prediction
Estimates the likelihood of nodes
forming a relationship
Evaluates how alike
nodes are
Similarity Embeddings
Learned representations
of connectivity or topology
• Connected components to identify
disjointed graphs sharing identifiers
• PageRank to measure influence and
transaction volumes
• Louvain to identify communities
that frequently interact
• Jaccard to measure account
similarity
24
Graph Connected Feature Engineering
Financial Crime: Detecting Fraud
Large financial institutions already have existing pipelines to identify
fraud via heuristics and models
Graph based features improve accuracy:
+48,000 U.S. Patents for
Graph Fraud / Anomaly Detection
in the last 10 years
Spark Graph Native Graph
Platform
Machine Learning
• Merge distributed data
into DataFrames
• Reshape your tables
into graphs
• Explore cypher queries
and simple algorithms
• Persist your graph
• Create rule based
features
• Run native graph
algorithms and write to
graph or stream
Graph Feature Engineering:
Getting Started Example with Spark
• Bring graph features
to ML pipeline for
training
Graph
Transactions
Graph
Analytics
27
Graph Algorithms in Neo4J
• Parallel Breadth First Search
• Parallel Depth First Search
• Shortest Path
• Single-Source Shortest Path
• All Pairs Shortest Path
• Minimum Spanning Tree
• A* Shortest Path
• Yen’s K Shortest Path
• K-Spanning Tree (MST)
• Random Walk
• Degree Centrality
• Closeness Centrality
• CC Variations: Harmonic, Dangalchev,
Wasserman & Faust
• Betweenness Centrality
• Approximate Betweenness Centrality
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count
• Clustering Coefficients
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity – 1 Step & Multi-Step
• Balanced Triad (identification)
• Euclidean Distance
• Cosine Similarity
• Jaccard Similarity
• Overlap Similarity
• Pearson Similarity
Pathfinding
& Search
Centrality /
Importance
Community
Detection
Similarity
neo4j.com/docs/
graph-algorithms/current/
Link
Prediction
• Adamic Adar
• Common Neighbors
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbors
Steps Forward in Graph Data Science
Query Based
Knowledge
Graph
Graph
Algorithm
Feature
Engineering
Graph Neural
Networks
Query Based
Feature
Engineering
Graph
Embeddings
Enterprise Maturity
DataScienceComplexity
Embedding transforms graphs into a feature vector, or
set of vectors, describing topology, connectivity, or
attributes of nodes and edges in the graph
29
Graph Embeddings
• Vertex/Node embeddings: describe connectivity of each node
• Path embeddings: traversals across the graph
• Graph embeddings: encode an entire graph into a single vector
Explainable Reasoning over Knowledge Graphs for
Recommendation
30
Graph Embeddings - Recommendations
31
Graph Embeddings - Recommendations
Explainable Reasoning over Knowledge Graphs for
Recommendation
Spark Graph Native Graph
Platform
Machine Learning
• Merge distributed data
into DataFrames
• Reshape your tables
into graphs
• Explore cypher queries
and simple algorithms
• Move to Neo4j to build
expert queries
• Write to persist
• Stay tuned for
DeepWalk and DeepGL
algorithms
Graph Feature Engineering-Embedding:
Getting Started Example with Spark
• Bring graph features
to ML pipeline for
training
Graph
Transactions
Graph
Analytics
Steps Forward in Graph Data Science
Query Based
Knowledge
Graph
Graph
Algorithm
Feature
Engineering
Query Based
Feature
Engineering
Graph Neural
Networks
Graph
Embeddings
Enterprise Maturity
DataScienceComplexity
Deep Learning refers to training multi-layer neural
networks using gradient descent
34
Graph Native Learning
Graph Native Learning refers to deep learning models
that take a graph as an input, performs computations,
and return a graph
35
Graph Native Learning
Battaglia et al, 2018
Example: electron path prediction
Bradshaw et al, 2019
36
Graph Native Learning
Given reactants and reagents, what will the
products be?
Given reactants and reagents, what will the
products be?
Steps Forward in Graph Data Science
Query Based
Knowledge
Graph
Query Based
Feature
Engineering
Graph
Algorithm
Feature
Engineering
Graph
Embeddings
Graph Neural
Networks
Enterprise Delivery
DataScienceComplexity
Knowledge
Graphs
Graph Feature
Engineering
Graph Native
Learning
Graph Persistence
Resources
Business
• neo4j.com/use-cases/
artificial-intelligence-analytics/
• AI Whitepaper
Data Scientists/Developers
• neo4j.com/sandbox
• neo4j.com/developer/
• community.neo4j.com
Amy.Hodler@neo4j.com
@amyhodler
neo4j.com/
graph-algorithms-book

More Related Content

What's hot (20)

PPTX
The art of the possible with graph technology_Neo4j GraphSummit Dublin 2023.pptx
Neo4j
 
PDF
How the Neanex digital twin solution delivers on both speed and scale to the ...
Neo4j
 
PDF
Graph database Use Cases
Max De Marzi
 
PDF
Building Modern Streaming Analytics with Confluent on AWS
confluent
 
PDF
LinkedInSaxoBankDataWorkbench
Sheetal Pratik
 
PDF
Slides: Knowledge Graphs vs. Property Graphs
DATAVERSITY
 
PPTX
Neo4j graph database
Prashant Bhargava
 
PDF
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
DataStax
 
PDF
Workshop - Build a Graph Solution
Neo4j
 
PDF
Nadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
IT Arena
 
PDF
The Path To Success With Graph Database and Analytics
Neo4j
 
PDF
Definitive Guide to Select Right Data Warehouse (2020)
Sprinkle Data Inc
 
PPTX
Azure Synapse Analytics Overview (r2)
James Serra
 
PDF
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
LinkedIn
 
PDF
Modernizing to a Cloud Data Architecture
Databricks
 
PDF
Your Roadmap for An Enterprise Graph Strategy
Neo4j
 
PPTX
Turning Data into Business Value with a Modern Data Platform
Cloudera, Inc.
 
PDF
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
PDF
Straight Talk to Demystify Data Lineage
DATAVERSITY
 
PPTX
Semantics and Machine Learning
Vladimir Alexiev, PhD, PMP
 
The art of the possible with graph technology_Neo4j GraphSummit Dublin 2023.pptx
Neo4j
 
How the Neanex digital twin solution delivers on both speed and scale to the ...
Neo4j
 
Graph database Use Cases
Max De Marzi
 
Building Modern Streaming Analytics with Confluent on AWS
confluent
 
LinkedInSaxoBankDataWorkbench
Sheetal Pratik
 
Slides: Knowledge Graphs vs. Property Graphs
DATAVERSITY
 
Neo4j graph database
Prashant Bhargava
 
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
DataStax
 
Workshop - Build a Graph Solution
Neo4j
 
Nadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
IT Arena
 
The Path To Success With Graph Database and Analytics
Neo4j
 
Definitive Guide to Select Right Data Warehouse (2020)
Sprinkle Data Inc
 
Azure Synapse Analytics Overview (r2)
James Serra
 
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
LinkedIn
 
Modernizing to a Cloud Data Architecture
Databricks
 
Your Roadmap for An Enterprise Graph Strategy
Neo4j
 
Turning Data into Business Value with a Modern Data Platform
Cloudera, Inc.
 
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
Straight Talk to Demystify Data Lineage
DATAVERSITY
 
Semantics and Machine Learning
Vladimir Alexiev, PhD, PMP
 

Similar to How Graphs Enhance AI (20)

PDF
Leveraging Graphs for Better AI
Neo4j
 
PDF
Leveraging Graphs for Better AI
Neo4j
 
PDF
How Graph Technology is Changing AI
Databricks
 
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Databricks
 
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Fred Madrid
 
PDF
GraphTour 2020 - Graphs & AI: A Path for Data Science
Neo4j
 
PDF
GraphTour London 2020 - Graphs for AI, Amy Hodler
Neo4j
 
PDF
What Is GDS and Neo4j’s GDS Library
Neo4j
 
PDF
GPT and Graph Data Science to power your Knowledge Graph
Neo4j
 
PDF
GraphSummit Toronto: Leveraging Graphs for AI and ML
Neo4j
 
PPTX
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Neo4j
 
PDF
Graph Data Science with Neo4j: Nordics Webinar
Neo4j
 
PDF
Leveraging Graphs for Artificial Intelligence and Machine Learning - Phani Da...
Neo4j
 
PDF
ntroducing to the Power of Graph Technology
Neo4j
 
PPTX
How Graph Data Science can turbocharge your Knowledge Graph
Neo4j
 
PDF
Graphs for Data Science and Machine Learning
Neo4j
 
PDF
3. Relationships Matter: Using Connected Data for Better Machine Learning
Neo4j
 
PDF
Leveraging Graphs for Artificial Intelligence and Machine Learning - Phani Da...
Neo4j
 
PDF
Workshop Tel Aviv - Graph Data Science
Neo4j
 
PDF
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
Ivan Zoratti
 
Leveraging Graphs for Better AI
Neo4j
 
Leveraging Graphs for Better AI
Neo4j
 
How Graph Technology is Changing AI
Databricks
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Databricks
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Fred Madrid
 
GraphTour 2020 - Graphs & AI: A Path for Data Science
Neo4j
 
GraphTour London 2020 - Graphs for AI, Amy Hodler
Neo4j
 
What Is GDS and Neo4j’s GDS Library
Neo4j
 
GPT and Graph Data Science to power your Knowledge Graph
Neo4j
 
GraphSummit Toronto: Leveraging Graphs for AI and ML
Neo4j
 
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Neo4j
 
Graph Data Science with Neo4j: Nordics Webinar
Neo4j
 
Leveraging Graphs for Artificial Intelligence and Machine Learning - Phani Da...
Neo4j
 
ntroducing to the Power of Graph Technology
Neo4j
 
How Graph Data Science can turbocharge your Knowledge Graph
Neo4j
 
Graphs for Data Science and Machine Learning
Neo4j
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
Neo4j
 
Leveraging Graphs for Artificial Intelligence and Machine Learning - Phani Da...
Neo4j
 
Workshop Tel Aviv - Graph Data Science
Neo4j
 
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
Ivan Zoratti
 
Ad

More from Neo4j (20)

PDF
GraphSummit Singapore Master Deck - May 20, 2025
Neo4j
 
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
 
PPTX
Neo4j Knowledge for Customer Experience.pptx
Neo4j
 
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j
 
PDF
Neo4j: The Art of the Possible with Graph
Neo4j
 
PDF
Smarter Knowledge Graphs For Public Sector
Neo4j
 
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
Neo4j
 
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
Neo4j
 
PDF
ANZ Presentation: GraphSummit Melbourne 2024
Neo4j
 
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Neo4j
 
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Neo4j
 
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Neo4j
 
PDF
Démonstration Digital Twin Building Wire Management
Neo4j
 
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Neo4j
 
PDF
Démonstration Supply Chain - GraphTalk Paris
Neo4j
 
PDF
The Art of Possible - GraphTalk Paris Opening Session
Neo4j
 
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Neo4j
 
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Neo4j
 
PDF
Neo4j Graph Data Modelling Session - GraphTalk
Neo4j
 
PDF
Neo4j: The Art of Possible with Graph Technology
Neo4j
 
GraphSummit Singapore Master Deck - May 20, 2025
Neo4j
 
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
 
Neo4j Knowledge for Customer Experience.pptx
Neo4j
 
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j
 
Neo4j: The Art of the Possible with Graph
Neo4j
 
Smarter Knowledge Graphs For Public Sector
Neo4j
 
GraphRAG and Knowledge Graphs Exploring AI's Future
Neo4j
 
Matinée GenAI & GraphRAG Paris - Décembre 24
Neo4j
 
ANZ Presentation: GraphSummit Melbourne 2024
Neo4j
 
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Neo4j
 
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Neo4j
 
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Neo4j
 
Démonstration Digital Twin Building Wire Management
Neo4j
 
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Neo4j
 
Démonstration Supply Chain - GraphTalk Paris
Neo4j
 
The Art of Possible - GraphTalk Paris Opening Session
Neo4j
 
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Neo4j
 
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Neo4j
 
Neo4j Graph Data Modelling Session - GraphTalk
Neo4j
 
Neo4j: The Art of Possible with Graph Technology
Neo4j
 
Ad

Recently uploaded (20)

PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
July Patch Tuesday
Ivanti
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
July Patch Tuesday
Ivanti
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 

How Graphs Enhance AI

  • 1. Amy E. Hodler Graph Analytics & AI Program Manager, Neo4j [email protected] @amyhodler How Graphs Enhance AI Graphs in Data Science Advancing through the Steps of Graph Data Science
  • 5. Financial Crimes Drug Discovery Recommendations Cybersecurity Predictive Maintenance Customer Segmentation Churn Prediction Search/MDM Graphs Data Science Applications
  • 6. “The idea is that graph networks are bigger than any one machine-learning approach. Graphs bring an ability to generalize about structure that the individual neural nets don't have.” "Where do the graphs come from that graph networks operate over?”
  • 8. Building a Graph ML Model Data Sources Native Graph Platform Machine Learning Aggregate Disparate Data and Cleanse Build Predictive Models Unify Graphs and Engineer Features Parquet JSON and more… MLlib and more…
  • 9. Spark Graph Native Graph Platform Machine Learning Example: Spark & Neo4j Workflow Graph Transactions Graph Analytics Cypher 9 in Spark 3.0 to create non- persistent graphs MLlib to Train Models Native Graph Algorithms, Processing, and Storage Morpheus integration
  • 10. Explore Graphs Build Graph Solutions • Massively scalable • Powerful data pipelining • Robust ML Libraries • Non-persistent, non-native graphs • Persistent, dynamic graphs • Graph native query and algorithm performance • Constantly growing list of graph algorithms and embeddings
  • 12. Steps Forward in Graph Data Science Graph Persistence Knowledge Graphs Connected Feature Engineering Graph Native Learning
  • 13. Steps Forward in Graph Data Science Query Based Knowledge Graph Query Based Feature Engineering Graph Algorithm Feature Engineering Graph Embeddings Graph Neural Networks Enterprise Delivery DataScienceComplexity Knowledge Graphs Graph Feature Engineering Graph Native Learning Graph Persistence
  • 14. Steps Forward in Graph Data Science Query Based Knowledge Graph Query Based Feature Engineering Graph Algorithm Feature Engineering Graph Embeddings Graph Neural Networks Enterprise Maturity DataScienceComplexity
  • 15. Query-Based Knowledge Graphs Connecting the Dots • Multiple graph layers of financial information • Includes corporate data with cross-relationships, external news, and customized weighting • Dashboards and tools • Credit risk • Investment risk • Portfolio news recommendations
  • 16. Steps Forward in Graph Data Science Query Based Knowledge Graph Graph Algorithm Feature Engineering Graph Embeddings Graph Neural Networks Query Based Feature Engineering Enterprise Maturity DataScienceComplexity
  • 17. HetioNet is a knowledge graph integrating over 50 years of biomedical data Leveraged to predict new uses for drugs by using the graph topology to create features to predict new links Query-Based Feature Engineering Mining Data for Drug Discovery het.io
  • 18. Query-Based Feature Engineering Mining Data for Drug Discovery HetioNet is a knowledge graph integrating over 50 years of biomedical data Leveraged to predict new uses for drugs by using the graph topology to create features to predict new links het.io
  • 19. Query-Based Feature Engineering Mining Data for Drug Discovery
  • 20. Spark Graph Native Graph Platform Machine Learning • Merge distributed data into DataFrames • Reshape your tables into graphs • Explore cypher queries • Move to Neo4j to build expert queries • Persist your graph Knowledge Graphs: Getting Started Example with Spark • Bring query based graph features to ML pipeline Graph Transactions Graph Analytics
  • 21. Steps Forward in Graph Data Science Query Based Feature Engineering Graph Embeddings Graph Neural Networks Query Based Knowledge Graph Graph Algorithm Feature Engineering Enterprise Maturity DataScienceComplexity
  • 22. Feature Engineering is how we combine and process the data to create new, more meaningful features, such as clustering or connectivity metrics. Graph Connected Feature Engineering Add More Descriptive Features: - Influence - Relationships - Communities Extraction
  • 23. 23 Graph Feature Categories & Algorithms Pathfinding & Search Finds the optimal paths or evaluates route availability and quality Centrality / Importance Determines the importance of distinct nodes in the network Community Detection Detects group clustering or partition options Heuristic Link Prediction Estimates the likelihood of nodes forming a relationship Evaluates how alike nodes are Similarity Embeddings Learned representations of connectivity or topology
  • 24. • Connected components to identify disjointed graphs sharing identifiers • PageRank to measure influence and transaction volumes • Louvain to identify communities that frequently interact • Jaccard to measure account similarity 24 Graph Connected Feature Engineering Financial Crime: Detecting Fraud Large financial institutions already have existing pipelines to identify fraud via heuristics and models Graph based features improve accuracy:
  • 25. +48,000 U.S. Patents for Graph Fraud / Anomaly Detection in the last 10 years
  • 26. Spark Graph Native Graph Platform Machine Learning • Merge distributed data into DataFrames • Reshape your tables into graphs • Explore cypher queries and simple algorithms • Persist your graph • Create rule based features • Run native graph algorithms and write to graph or stream Graph Feature Engineering: Getting Started Example with Spark • Bring graph features to ML pipeline for training Graph Transactions Graph Analytics
  • 27. 27 Graph Algorithms in Neo4J • Parallel Breadth First Search • Parallel Depth First Search • Shortest Path • Single-Source Shortest Path • All Pairs Shortest Path • Minimum Spanning Tree • A* Shortest Path • Yen’s K Shortest Path • K-Spanning Tree (MST) • Random Walk • Degree Centrality • Closeness Centrality • CC Variations: Harmonic, Dangalchev, Wasserman & Faust • Betweenness Centrality • Approximate Betweenness Centrality • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Triangle Count • Clustering Coefficients • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity – 1 Step & Multi-Step • Balanced Triad (identification) • Euclidean Distance • Cosine Similarity • Jaccard Similarity • Overlap Similarity • Pearson Similarity Pathfinding & Search Centrality / Importance Community Detection Similarity neo4j.com/docs/ graph-algorithms/current/ Link Prediction • Adamic Adar • Common Neighbors • Preferential Attachment • Resource Allocations • Same Community • Total Neighbors
  • 28. Steps Forward in Graph Data Science Query Based Knowledge Graph Graph Algorithm Feature Engineering Graph Neural Networks Query Based Feature Engineering Graph Embeddings Enterprise Maturity DataScienceComplexity
  • 29. Embedding transforms graphs into a feature vector, or set of vectors, describing topology, connectivity, or attributes of nodes and edges in the graph 29 Graph Embeddings • Vertex/Node embeddings: describe connectivity of each node • Path embeddings: traversals across the graph • Graph embeddings: encode an entire graph into a single vector
  • 30. Explainable Reasoning over Knowledge Graphs for Recommendation 30 Graph Embeddings - Recommendations
  • 31. 31 Graph Embeddings - Recommendations Explainable Reasoning over Knowledge Graphs for Recommendation
  • 32. Spark Graph Native Graph Platform Machine Learning • Merge distributed data into DataFrames • Reshape your tables into graphs • Explore cypher queries and simple algorithms • Move to Neo4j to build expert queries • Write to persist • Stay tuned for DeepWalk and DeepGL algorithms Graph Feature Engineering-Embedding: Getting Started Example with Spark • Bring graph features to ML pipeline for training Graph Transactions Graph Analytics
  • 33. Steps Forward in Graph Data Science Query Based Knowledge Graph Graph Algorithm Feature Engineering Query Based Feature Engineering Graph Neural Networks Graph Embeddings Enterprise Maturity DataScienceComplexity
  • 34. Deep Learning refers to training multi-layer neural networks using gradient descent 34 Graph Native Learning
  • 35. Graph Native Learning refers to deep learning models that take a graph as an input, performs computations, and return a graph 35 Graph Native Learning Battaglia et al, 2018
  • 36. Example: electron path prediction Bradshaw et al, 2019 36 Graph Native Learning Given reactants and reagents, what will the products be? Given reactants and reagents, what will the products be?
  • 37. Steps Forward in Graph Data Science Query Based Knowledge Graph Query Based Feature Engineering Graph Algorithm Feature Engineering Graph Embeddings Graph Neural Networks Enterprise Delivery DataScienceComplexity Knowledge Graphs Graph Feature Engineering Graph Native Learning Graph Persistence
  • 38. Resources Business • neo4j.com/use-cases/ artificial-intelligence-analytics/ • AI Whitepaper Data Scientists/Developers • neo4j.com/sandbox • neo4j.com/developer/ • community.neo4j.com [email protected] @amyhodler neo4j.com/ graph-algorithms-book