SlideShare a Scribd company logo
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
1
Knowledge Graphs & Graph Data Science:
More Context, Better Predictions
Dr. Maya Natarajan
Sr. Director, Product Marketing
[Knowledge Graphs]
maya.Natarajan@neo4j.com
Dr. Alexander Jarasch
Life Sciences Technology Specialist
Board Member, HealthECCO
alexander.jarasch@neo4j.com
Neo4j, Inc. All rights reserved 2021
2
Data Without Relationships Has Very Little Context
2
It’s not the raw numbers, it’s the relationships behind them
Neo4j, Inc. All rights reserved 2021
The Quest for Human-Like Intelligence & Beyond
3
Graph-Native
ML
Graph
Algorithms
Graph
Queries
Knowledge
Graph Find the patterns
you’re looking for
in connected data
Use unsupervised
machine learning
techniques to
identify associations,
anomalies, and
trends.
Use embeddings to learn
the features in your graph
that you don’t even know
are important yet.
Train in-graph supervised
ML models to predict
links, labels, and missing
data.
Neo4j, Inc. All rights reserved 2021
4
Building a Knowledge Graph
4
Data Graph
Dynamic Context
Knowledge Graph
Deep Dynamic Context
Semantics
Relationships
Neo4j, Inc. All rights reserved 2021
5
A New Generation of Complex Data
5
A B C D E
A B C D E
One-to-Many
Relationships
Across Many
Entities
Small, Wide Data Complex Data Hierarchical & Recursive Data
Many-to-Many
Relationships
Nested Tree
Structures
Recursion (Self-
Joins)
Deep
Hierarchies
Link Inference
(If C relates to A and A relates to E,
then C must relate to E)
Node Similarity
Hidden Data
Legacy Data Frozen Data
Legacy SQL Systems Data Lake Fact Tables Graph Data Science - Machine Reasoning
A
C
E
Neo4j, Inc. All rights reserved 2021
6
Semantics
6
Entity Resolution
& Analysis
Classification
& Tagging
Artificial
Intelligence
C O M P L E X I T Y
Synonym Rings Taxonomy Thesaurus Ontology
Controlled Vocabularies
Neo4j, Inc. All rights reserved 2021
7
Graph Queries
7
Hey, knowledge graph! Tell me “which genes regulate which pathways”.
MATCH path=(g:Gene)-[r:REGULATES]->(p:Pathway)
RETURN path
Neo4j, Inc. All rights reserved 2021
8
Graph Data Science Answers the Big Questions
8
What’s important?
What’s unusual?
What’s going to happen next?
But traditional
approaches to data make
it impossible to reveal and
effectively use those
connections as data sizes
become large
Predictive signals get lost in
big data noise
Relationships are
Powerful
Graph Data Science
Uses Relationships to Answer
Critical Questions
Neo4j, Inc. All rights reserved 2021
9
Graph Algorithms
9
Pathfinding & Search Centrality Community Detection
❏ Shortest Path
❏ Single-Source Shortest Path
❏ Delta Stepping
❏ All Pairs Shortest Path
❏ A* Shortest Path
❏ Yen’s K Shortest Path
❏ Minimum Weight Spanning Tree
❏ K-Spanning Tree (MST)
❏ Random Walk
❏ Breadth & Depth First Search
❏ Degree Centrality
❏ Closeness Centrality
❏ Harmonic Centrality
❏ Betweenness Centrality & Approx.
❏ PageRank
❏ Personalized PageRank
❏ ArticleRank
❏ Eigenvector Centrality
❏ Hyperlink Induced Topic Search (HITS)
❏ Influence Maximization (Greedy, CELF)
❏ Triangle Count
❏ Local Clustering Coefficient
❏ Weakly Connected Components
❏ Strongly Connected Components
❏ Label Propagation
❏ Louvain Modularity
❏ K-1 Coloring
❏ Modularity Optimization
❏ Speaker Listener Label Propagation
❏ K-means Clustering
❏ Leiden Algorithm
❏ Conductance
Heuristic LP Similarity Graph Embeddings
❏ Adamic Adar
❏ Common Neighbors
❏ Preferential Attachment
❏ Resource Allocations
❏ Same Community
❏ Total Neighbors
❏ Node Similarity
❏ Filtered Node Similarity
❏ K-Nearest Neighbors (KNN)
❏ Filtered KNN
❏ Functions: Pearson, Jaccard,
Overlap, Cosine, Euclidean,
Euclidean Distance
❏ Node2Vec
❏ FastRP, FastRPExtended
❏ GraphSAGE
Neo4j, Inc. All rights reserved 2021
10
Machine Learning Pipelines
10
Node classification:
“What label should this node have?”
Link prediction:
“Is there a relationship between these nodes?”
Labeled data: Pairs of
nodes that are either
linked or not
Features: Pre-existing
attributes, algorithm
results, embeddings
Property Regression (new!)
“What’s the value for this missing property?”
We discover the best model for you - you just supply the data!
Persist and Publish for Production
Neo4j, Inc. All rights reserved 2021
11
The Drug LifeCycle
11
DISCOVERY DEVELOPMENT COMMERCIALIZATION
⮚ Identify targets to
pursue (traditional or
by drug repurposing)
⮚ Management of
compounds to
identify samples
swaps, HTS gaps,
etc.
⮚ Identify the right
investigator &
patients for the
right trial at the
right time
⮚ Improve patient
outcomes using
patient journey
analytics
⮚ Identify and
connect patient
complaints with
raw material
batches
Target
Discovery
Hit
Generation
Lead
Identification
Lead
Optimization
Animal
Models
Clinical
Trials
FDA/EMA
Review &
Approval
Post
Approval
Manufacturing
Neo4j, Inc. All rights reserved 2021
12
12
Increasing Volumes of Information
August 2020: 128,000
May 2022: 258, 647
Neo4j, Inc. All rights reserved 2021
13
Building up the knowledge graph
13
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
14
Use Case 1
”Freedom-to-Operate on Hemostasis”
Interactive Browsing
Neo4j, Inc. All rights reserved 2021
15
“Freedom-to-Operate on Hemostasis”
• Based on Gene PF4
known to be
involved in Platelets
Hemostasis
• Browsing the patent
landscape
• Browsing molecular
pathways
• Understanding the
mode of action
15
Neo4j, Inc. All rights reserved 2021
16
From Pathway to Patents to SysBio-Model in 30s
16
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
17
Use Case 2
Cross-Species Ontology
Graph Queries
Neo4j, Inc. All rights reserved 2021
18
Schematic approach
18
Onto
logy
Ter
m
Prot
ein
Tran
scrip
t
Gen
e
Onto
logy
Ter
m
Prot
ein
Tran
scrip
t
Gen
e
HAS HAS
ASSOCIATED
ASSOCIATED
CODES CODES
CODES
CODES
HOMOLOG
SYNONYM
SYNONYM
SYNONYM SYNONYM
SYNONYM
SYNONYM
Neo4j, Inc. All rights reserved 2021
19
Querying between Ontologies and Genes
19
Example: Equivalent HP-/MP-Term “Deafness”/”Hearing impairment” and ortholog Genes
Neo4j, Inc. All rights reserved 2021
20
Querying between Ontologies and Genes
20
Example: Equivalent HP-/MP-Term “Deafness”/”Hearing impairment” and ortholog Genes
MATCH (disease:Ontology)-[:HAS_TERM]-
>(hpTerm:Term)<-[:ASSOCIATED_WITH]-
(p1:Protein)<-[:CODES*1..3]-
(t1:Transcript)<-[:CODES*1..3]-
(g1:Gene)-[:HOMOLOG]-(g2:Gene)-
[:CODES*1..3]->(t2:Transcript)-
[:CODES*1..3]->(p2:Protein)-
[:ASSOCIATED_WITH]->(mpTerm:Term)<-
[:HAS_TERM]-(phenotype:Ontology)
WHERE hpTerm ~= ‘deafness’ OR ‘hearing
impairment’
RETURN *
Neo4j, Inc. All rights reserved 2021
Schematic approach
MATCH (disease:Ontology)-[:HAS_TERM]->(hpTerm:Term)<-[:ASSOCIATED_WITH]-
(p1:Protein)<-[:CODES*1..3]-(t1:Transcript)<-[:CODES*1..3]-(g1:Gene)-
[:HOMOLOG]-(g2:Gene)
-[:CODES*1..3]->(t2:Transcript)-[:CODES*1..3]->(p2:Protein)-
[:ASSOCIATED_WITH]->(mpTerm:Term)<-[:HAS_TERM]-(phenotype:Ontology)
WHERE hpTerm ~= ‘deafness’ OR ‘hearing impairment’
RETURN *
Ont
olo
gy
Ter
m
Pro
tein
Tra
nsc
ript
Ge
ne
Ont
olo
gy
Ter
m
Pro
tein
Tra
nsc
ript
Ge
ne
HAS HAS
ASSOCIATED
ASSOCIATED
CODES CODES
CODES
CODES
HOMOLOG
SYNONYM
SYNONYM
SYNONYM SYNONYM
SYNONYM
SYNONYM
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
22
Use Case 3
Find The Most Relevant COVID19 Gene
Graph Algorithms
Neo4j, Inc. All rights reserved 2021
23
Find The Most Relevant COVID19 Gene
• Finding ACE2: the
receptor SARS-Cov2
virus uses to enter the
cell
• GDS PageRank
• Spike protein binds
ACE2
23
Neo4j, Inc. All rights reserved 2021
24
Find The Most Relevant COVID19 Gene
• Finding ACE2 - the
receptor SARS-Cov2
virus uses to enter the
cell
• GDS PageRank
• Spike protein binds
ACE2
• Target no.1 for vaccines
• Understand the mode of
action
24
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
25
Use Case 4
Sub-Phenotyping Of Patient Populations
Graph Embeddings
Neo4j, Inc. All rights reserved 2021
26
Clinical Data from Patients
26
Gender Age BMI diab.status Sample coll #samples #used
surgery disease Histological diagnosis
Neo4j, Inc. All rights reserved 2021
27
Multi-omics Fingerprint (1) - Lipidomics
27
Neo4j, Inc. All rights reserved 2021
28
Multi-omics Fingerprint (2) - Transcriptomics
28
Neo4j, Inc. All rights reserved 2021
29
Transform Data with GDS - Fast Random Projections
29
CALL gds.fastRP.write(
'patients',
{
embeddingDimension: 50,
writeProperty: 'fastrp-
embedding'
}
)
YIELD nodePropertiesWritten
Neo4j, Inc. All rights reserved 2021
30
k-Nearest Neighbour Clustering (k=5)
30
Graph Algorithms
Neo4j, Inc. All rights reserved 2021
31
Connect Patient Data with a Knowledge Graph
31
Neo4j, Inc. All rights reserved 2021
32
Key Messages in this Presentation
In this presentation:
• Knowledge graphs enable the harnessing of relationships and provide
deep, dynamic context important for analytic and machine learning
• Knowledge graphs are the gateway to powerful graph analytics
• Real world pharma use cases:
◦ Use Case 1: Interactive Browsing
◦ Use Case 2: Graph Queries
◦ Use Case 3: Graph Algorithms
◦ Use Case 4: Graph Embeddings
32
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
33
Thank you!
Contact us:
maya.natarajan@neo4j.com
alexander.jarasch@neo4j.com

More Related Content

What's hot (20)

PDF
A Universe of Knowledge Graphs
Neo4j
 
PDF
The Knowledge Graph Explosion
Neo4j
 
PPTX
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j
 
PDF
Optimizing Your Supply Chain with the Neo4j Graph
Neo4j
 
PDF
Knowledge Graphs and Generative AI
Neo4j
 
PDF
Neo4j: The path to success with Graph Database and Graph Data Science
Neo4j
 
PPTX
Graph Data Modeling Best Practices(Eric_Monk).pptx
Neo4j
 
PDF
Data Modeling with Neo4j
Neo4j
 
PPTX
Graph-Based Network Topology Analysis for Telecom Operators
Neo4j
 
PDF
2023 Trends in Enterprise Analytics
DATAVERSITY
 
PDF
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
DATAVERSITY
 
PDF
Modernizing to a Cloud Data Architecture
Databricks
 
PDF
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 
PPTX
The art of the possible with graph technology_Neo4j GraphSummit Dublin 2023.pptx
Neo4j
 
PPTX
Smarter Fraud Detection With Graph Data Science
Neo4j
 
PDF
GPT and Graph Data Science to power your Knowledge Graph
Neo4j
 
PDF
Graphs for Finance - AML with Neo4j Graph Data Science
Neo4j
 
PDF
Neo4j : Graphes de Connaissance, IA et LLMs
Neo4j
 
PPTX
Elsevier: Empowering Knowledge Discovery in Research with Graphs
Neo4j
 
PDF
Slides: Knowledge Graphs vs. Property Graphs
DATAVERSITY
 
A Universe of Knowledge Graphs
Neo4j
 
The Knowledge Graph Explosion
Neo4j
 
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j
 
Optimizing Your Supply Chain with the Neo4j Graph
Neo4j
 
Knowledge Graphs and Generative AI
Neo4j
 
Neo4j: The path to success with Graph Database and Graph Data Science
Neo4j
 
Graph Data Modeling Best Practices(Eric_Monk).pptx
Neo4j
 
Data Modeling with Neo4j
Neo4j
 
Graph-Based Network Topology Analysis for Telecom Operators
Neo4j
 
2023 Trends in Enterprise Analytics
DATAVERSITY
 
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
DATAVERSITY
 
Modernizing to a Cloud Data Architecture
Databricks
 
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 
The art of the possible with graph technology_Neo4j GraphSummit Dublin 2023.pptx
Neo4j
 
Smarter Fraud Detection With Graph Data Science
Neo4j
 
GPT and Graph Data Science to power your Knowledge Graph
Neo4j
 
Graphs for Finance - AML with Neo4j Graph Data Science
Neo4j
 
Neo4j : Graphes de Connaissance, IA et LLMs
Neo4j
 
Elsevier: Empowering Knowledge Discovery in Research with Graphs
Neo4j
 
Slides: Knowledge Graphs vs. Property Graphs
DATAVERSITY
 

Similar to Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo4j at Pharma Data UK 2022 (20)

PDF
From Target to Product - Accelerating the Drug Lifecycle with Knowledge Graph...
Neo4j
 
PDF
Neo4j for Healthcare & Life Sciences
Neo4j
 
PDF
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j
 
PDF
Einstieg in Neo4j Graph Data Science
Neo4j
 
PDF
Graph Data Science with Neo4j: Nordics Webinar
Neo4j
 
PDF
GraphSummit Toronto: Leveraging Graphs for AI and ML
Neo4j
 
PDF
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Neo4j
 
PDF
Deeper Insights with Graph Data Science
Neo4j
 
PPTX
How Graph Data Science can turbocharge your Knowledge Graph
Neo4j
 
PDF
ntroducing to the Power of Graph Technology
Neo4j
 
PPTX
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Neo4j
 
PDF
Relationships Matter: Using Connected Data for Better Machine Learning
Neo4j
 
PDF
Graphs for Data Science and Machine Learning
Neo4j
 
PDF
4. Document Discovery with Graph Data Science
Neo4j
 
PDF
Download Wondershare Filmora Crack [2025] With Latest
tahirabibi60507
 
PDF
FL Studio Producer Edition Crack 2025 Full Version
tahirabibi60507
 
PDF
Windows 11 Professional Preactivated 2025
blouch134kp
 
PDF
Adobe Premiere Pro 2025 free (v25.2.1.002)
blouch136kp
 
PPTX
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
 
PDF
The Path To Success With Graph Database and Analytics
Neo4j
 
From Target to Product - Accelerating the Drug Lifecycle with Knowledge Graph...
Neo4j
 
Neo4j for Healthcare & Life Sciences
Neo4j
 
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j
 
Einstieg in Neo4j Graph Data Science
Neo4j
 
Graph Data Science with Neo4j: Nordics Webinar
Neo4j
 
GraphSummit Toronto: Leveraging Graphs for AI and ML
Neo4j
 
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Neo4j
 
Deeper Insights with Graph Data Science
Neo4j
 
How Graph Data Science can turbocharge your Knowledge Graph
Neo4j
 
ntroducing to the Power of Graph Technology
Neo4j
 
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Neo4j
 
Relationships Matter: Using Connected Data for Better Machine Learning
Neo4j
 
Graphs for Data Science and Machine Learning
Neo4j
 
4. Document Discovery with Graph Data Science
Neo4j
 
Download Wondershare Filmora Crack [2025] With Latest
tahirabibi60507
 
FL Studio Producer Edition Crack 2025 Full Version
tahirabibi60507
 
Windows 11 Professional Preactivated 2025
blouch134kp
 
Adobe Premiere Pro 2025 free (v25.2.1.002)
blouch136kp
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
 
The Path To Success With Graph Database and Analytics
Neo4j
 
Ad

More from Neo4j (20)

PDF
GraphSummit Singapore Master Deck - May 20, 2025
Neo4j
 
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
 
PPTX
Neo4j Knowledge for Customer Experience.pptx
Neo4j
 
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j
 
PDF
Neo4j: The Art of the Possible with Graph
Neo4j
 
PDF
Smarter Knowledge Graphs For Public Sector
Neo4j
 
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
Neo4j
 
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
Neo4j
 
PDF
ANZ Presentation: GraphSummit Melbourne 2024
Neo4j
 
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Neo4j
 
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Neo4j
 
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Neo4j
 
PDF
Démonstration Digital Twin Building Wire Management
Neo4j
 
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Neo4j
 
PDF
Démonstration Supply Chain - GraphTalk Paris
Neo4j
 
PDF
The Art of Possible - GraphTalk Paris Opening Session
Neo4j
 
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Neo4j
 
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Neo4j
 
PDF
Neo4j Graph Data Modelling Session - GraphTalk
Neo4j
 
PDF
Neo4j: The Art of Possible with Graph Technology
Neo4j
 
GraphSummit Singapore Master Deck - May 20, 2025
Neo4j
 
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
 
Neo4j Knowledge for Customer Experience.pptx
Neo4j
 
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j
 
Neo4j: The Art of the Possible with Graph
Neo4j
 
Smarter Knowledge Graphs For Public Sector
Neo4j
 
GraphRAG and Knowledge Graphs Exploring AI's Future
Neo4j
 
Matinée GenAI & GraphRAG Paris - Décembre 24
Neo4j
 
ANZ Presentation: GraphSummit Melbourne 2024
Neo4j
 
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Neo4j
 
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Neo4j
 
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Neo4j
 
Démonstration Digital Twin Building Wire Management
Neo4j
 
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Neo4j
 
Démonstration Supply Chain - GraphTalk Paris
Neo4j
 
The Art of Possible - GraphTalk Paris Opening Session
Neo4j
 
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Neo4j
 
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Neo4j
 
Neo4j Graph Data Modelling Session - GraphTalk
Neo4j
 
Neo4j: The Art of Possible with Graph Technology
Neo4j
 
Ad

Recently uploaded (20)

PPTX
Customise Your Correlation Table in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
PPTX
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
PDF
Empower Your Tech Vision- Why Businesses Prefer to Hire Remote Developers fro...
logixshapers59
 
PPTX
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
PDF
Wondershare PDFelement Pro Crack for MacOS New Version Latest 2025
bashirkhan333g
 
PDF
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
PDF
NEW-Viral>Wondershare Filmora 14.5.18.12900 Crack Free
sherryg1122g
 
PDF
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
PPTX
Coefficient of Variance in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PDF
AOMEI Partition Assistant Crack 10.8.2 + WinPE Free Downlaod New Version 2025
bashirkhan333g
 
PDF
IDM Crack with Internet Download Manager 6.42 Build 43 with Patch Latest 2025
bashirkhan333g
 
PDF
TheFutureIsDynamic-BoxLang witch Luis Majano.pdf
Ortus Solutions, Corp
 
PDF
Driver Easy Pro 6.1.1 Crack Licensce key 2025 FREE
utfefguu
 
PDF
SciPy 2025 - Packaging a Scientific Python Project
Henry Schreiner
 
PDF
The 5 Reasons for IT Maintenance - Arna Softech
Arna Softech
 
PDF
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
PDF
Odoo CRM vs Zoho CRM: Honest Comparison 2025
Odiware Technologies Private Limited
 
PDF
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
PPTX
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
Customise Your Correlation Table in IBM SPSS Statistics.pptx
Version 1 Analytics
 
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
Empower Your Tech Vision- Why Businesses Prefer to Hire Remote Developers fro...
logixshapers59
 
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
Wondershare PDFelement Pro Crack for MacOS New Version Latest 2025
bashirkhan333g
 
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
NEW-Viral>Wondershare Filmora 14.5.18.12900 Crack Free
sherryg1122g
 
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
Coefficient of Variance in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
AOMEI Partition Assistant Crack 10.8.2 + WinPE Free Downlaod New Version 2025
bashirkhan333g
 
IDM Crack with Internet Download Manager 6.42 Build 43 with Patch Latest 2025
bashirkhan333g
 
TheFutureIsDynamic-BoxLang witch Luis Majano.pdf
Ortus Solutions, Corp
 
Driver Easy Pro 6.1.1 Crack Licensce key 2025 FREE
utfefguu
 
SciPy 2025 - Packaging a Scientific Python Project
Henry Schreiner
 
The 5 Reasons for IT Maintenance - Arna Softech
Arna Softech
 
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
Odoo CRM vs Zoho CRM: Honest Comparison 2025
Odiware Technologies Private Limited
 
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 

Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo4j at Pharma Data UK 2022

  • 1. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 1 Knowledge Graphs & Graph Data Science: More Context, Better Predictions Dr. Maya Natarajan Sr. Director, Product Marketing [Knowledge Graphs] [email protected] Dr. Alexander Jarasch Life Sciences Technology Specialist Board Member, HealthECCO [email protected]
  • 2. Neo4j, Inc. All rights reserved 2021 2 Data Without Relationships Has Very Little Context 2 It’s not the raw numbers, it’s the relationships behind them
  • 3. Neo4j, Inc. All rights reserved 2021 The Quest for Human-Like Intelligence & Beyond 3 Graph-Native ML Graph Algorithms Graph Queries Knowledge Graph Find the patterns you’re looking for in connected data Use unsupervised machine learning techniques to identify associations, anomalies, and trends. Use embeddings to learn the features in your graph that you don’t even know are important yet. Train in-graph supervised ML models to predict links, labels, and missing data.
  • 4. Neo4j, Inc. All rights reserved 2021 4 Building a Knowledge Graph 4 Data Graph Dynamic Context Knowledge Graph Deep Dynamic Context Semantics Relationships
  • 5. Neo4j, Inc. All rights reserved 2021 5 A New Generation of Complex Data 5 A B C D E A B C D E One-to-Many Relationships Across Many Entities Small, Wide Data Complex Data Hierarchical & Recursive Data Many-to-Many Relationships Nested Tree Structures Recursion (Self- Joins) Deep Hierarchies Link Inference (If C relates to A and A relates to E, then C must relate to E) Node Similarity Hidden Data Legacy Data Frozen Data Legacy SQL Systems Data Lake Fact Tables Graph Data Science - Machine Reasoning A C E
  • 6. Neo4j, Inc. All rights reserved 2021 6 Semantics 6 Entity Resolution & Analysis Classification & Tagging Artificial Intelligence C O M P L E X I T Y Synonym Rings Taxonomy Thesaurus Ontology Controlled Vocabularies
  • 7. Neo4j, Inc. All rights reserved 2021 7 Graph Queries 7 Hey, knowledge graph! Tell me “which genes regulate which pathways”. MATCH path=(g:Gene)-[r:REGULATES]->(p:Pathway) RETURN path
  • 8. Neo4j, Inc. All rights reserved 2021 8 Graph Data Science Answers the Big Questions 8 What’s important? What’s unusual? What’s going to happen next? But traditional approaches to data make it impossible to reveal and effectively use those connections as data sizes become large Predictive signals get lost in big data noise Relationships are Powerful Graph Data Science Uses Relationships to Answer Critical Questions
  • 9. Neo4j, Inc. All rights reserved 2021 9 Graph Algorithms 9 Pathfinding & Search Centrality Community Detection ❏ Shortest Path ❏ Single-Source Shortest Path ❏ Delta Stepping ❏ All Pairs Shortest Path ❏ A* Shortest Path ❏ Yen’s K Shortest Path ❏ Minimum Weight Spanning Tree ❏ K-Spanning Tree (MST) ❏ Random Walk ❏ Breadth & Depth First Search ❏ Degree Centrality ❏ Closeness Centrality ❏ Harmonic Centrality ❏ Betweenness Centrality & Approx. ❏ PageRank ❏ Personalized PageRank ❏ ArticleRank ❏ Eigenvector Centrality ❏ Hyperlink Induced Topic Search (HITS) ❏ Influence Maximization (Greedy, CELF) ❏ Triangle Count ❏ Local Clustering Coefficient ❏ Weakly Connected Components ❏ Strongly Connected Components ❏ Label Propagation ❏ Louvain Modularity ❏ K-1 Coloring ❏ Modularity Optimization ❏ Speaker Listener Label Propagation ❏ K-means Clustering ❏ Leiden Algorithm ❏ Conductance Heuristic LP Similarity Graph Embeddings ❏ Adamic Adar ❏ Common Neighbors ❏ Preferential Attachment ❏ Resource Allocations ❏ Same Community ❏ Total Neighbors ❏ Node Similarity ❏ Filtered Node Similarity ❏ K-Nearest Neighbors (KNN) ❏ Filtered KNN ❏ Functions: Pearson, Jaccard, Overlap, Cosine, Euclidean, Euclidean Distance ❏ Node2Vec ❏ FastRP, FastRPExtended ❏ GraphSAGE
  • 10. Neo4j, Inc. All rights reserved 2021 10 Machine Learning Pipelines 10 Node classification: “What label should this node have?” Link prediction: “Is there a relationship between these nodes?” Labeled data: Pairs of nodes that are either linked or not Features: Pre-existing attributes, algorithm results, embeddings Property Regression (new!) “What’s the value for this missing property?” We discover the best model for you - you just supply the data! Persist and Publish for Production
  • 11. Neo4j, Inc. All rights reserved 2021 11 The Drug LifeCycle 11 DISCOVERY DEVELOPMENT COMMERCIALIZATION ⮚ Identify targets to pursue (traditional or by drug repurposing) ⮚ Management of compounds to identify samples swaps, HTS gaps, etc. ⮚ Identify the right investigator & patients for the right trial at the right time ⮚ Improve patient outcomes using patient journey analytics ⮚ Identify and connect patient complaints with raw material batches Target Discovery Hit Generation Lead Identification Lead Optimization Animal Models Clinical Trials FDA/EMA Review & Approval Post Approval Manufacturing
  • 12. Neo4j, Inc. All rights reserved 2021 12 12 Increasing Volumes of Information August 2020: 128,000 May 2022: 258, 647
  • 13. Neo4j, Inc. All rights reserved 2021 13 Building up the knowledge graph 13
  • 14. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 14 Use Case 1 ”Freedom-to-Operate on Hemostasis” Interactive Browsing
  • 15. Neo4j, Inc. All rights reserved 2021 15 “Freedom-to-Operate on Hemostasis” • Based on Gene PF4 known to be involved in Platelets Hemostasis • Browsing the patent landscape • Browsing molecular pathways • Understanding the mode of action 15
  • 16. Neo4j, Inc. All rights reserved 2021 16 From Pathway to Patents to SysBio-Model in 30s 16
  • 17. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 17 Use Case 2 Cross-Species Ontology Graph Queries
  • 18. Neo4j, Inc. All rights reserved 2021 18 Schematic approach 18 Onto logy Ter m Prot ein Tran scrip t Gen e Onto logy Ter m Prot ein Tran scrip t Gen e HAS HAS ASSOCIATED ASSOCIATED CODES CODES CODES CODES HOMOLOG SYNONYM SYNONYM SYNONYM SYNONYM SYNONYM SYNONYM
  • 19. Neo4j, Inc. All rights reserved 2021 19 Querying between Ontologies and Genes 19 Example: Equivalent HP-/MP-Term “Deafness”/”Hearing impairment” and ortholog Genes
  • 20. Neo4j, Inc. All rights reserved 2021 20 Querying between Ontologies and Genes 20 Example: Equivalent HP-/MP-Term “Deafness”/”Hearing impairment” and ortholog Genes MATCH (disease:Ontology)-[:HAS_TERM]- >(hpTerm:Term)<-[:ASSOCIATED_WITH]- (p1:Protein)<-[:CODES*1..3]- (t1:Transcript)<-[:CODES*1..3]- (g1:Gene)-[:HOMOLOG]-(g2:Gene)- [:CODES*1..3]->(t2:Transcript)- [:CODES*1..3]->(p2:Protein)- [:ASSOCIATED_WITH]->(mpTerm:Term)<- [:HAS_TERM]-(phenotype:Ontology) WHERE hpTerm ~= ‘deafness’ OR ‘hearing impairment’ RETURN *
  • 21. Neo4j, Inc. All rights reserved 2021 Schematic approach MATCH (disease:Ontology)-[:HAS_TERM]->(hpTerm:Term)<-[:ASSOCIATED_WITH]- (p1:Protein)<-[:CODES*1..3]-(t1:Transcript)<-[:CODES*1..3]-(g1:Gene)- [:HOMOLOG]-(g2:Gene) -[:CODES*1..3]->(t2:Transcript)-[:CODES*1..3]->(p2:Protein)- [:ASSOCIATED_WITH]->(mpTerm:Term)<-[:HAS_TERM]-(phenotype:Ontology) WHERE hpTerm ~= ‘deafness’ OR ‘hearing impairment’ RETURN * Ont olo gy Ter m Pro tein Tra nsc ript Ge ne Ont olo gy Ter m Pro tein Tra nsc ript Ge ne HAS HAS ASSOCIATED ASSOCIATED CODES CODES CODES CODES HOMOLOG SYNONYM SYNONYM SYNONYM SYNONYM SYNONYM SYNONYM
  • 22. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 22 Use Case 3 Find The Most Relevant COVID19 Gene Graph Algorithms
  • 23. Neo4j, Inc. All rights reserved 2021 23 Find The Most Relevant COVID19 Gene • Finding ACE2: the receptor SARS-Cov2 virus uses to enter the cell • GDS PageRank • Spike protein binds ACE2 23
  • 24. Neo4j, Inc. All rights reserved 2021 24 Find The Most Relevant COVID19 Gene • Finding ACE2 - the receptor SARS-Cov2 virus uses to enter the cell • GDS PageRank • Spike protein binds ACE2 • Target no.1 for vaccines • Understand the mode of action 24
  • 25. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 25 Use Case 4 Sub-Phenotyping Of Patient Populations Graph Embeddings
  • 26. Neo4j, Inc. All rights reserved 2021 26 Clinical Data from Patients 26 Gender Age BMI diab.status Sample coll #samples #used surgery disease Histological diagnosis
  • 27. Neo4j, Inc. All rights reserved 2021 27 Multi-omics Fingerprint (1) - Lipidomics 27
  • 28. Neo4j, Inc. All rights reserved 2021 28 Multi-omics Fingerprint (2) - Transcriptomics 28
  • 29. Neo4j, Inc. All rights reserved 2021 29 Transform Data with GDS - Fast Random Projections 29 CALL gds.fastRP.write( 'patients', { embeddingDimension: 50, writeProperty: 'fastrp- embedding' } ) YIELD nodePropertiesWritten
  • 30. Neo4j, Inc. All rights reserved 2021 30 k-Nearest Neighbour Clustering (k=5) 30 Graph Algorithms
  • 31. Neo4j, Inc. All rights reserved 2021 31 Connect Patient Data with a Knowledge Graph 31
  • 32. Neo4j, Inc. All rights reserved 2021 32 Key Messages in this Presentation In this presentation: • Knowledge graphs enable the harnessing of relationships and provide deep, dynamic context important for analytic and machine learning • Knowledge graphs are the gateway to powerful graph analytics • Real world pharma use cases: ◦ Use Case 1: Interactive Browsing ◦ Use Case 2: Graph Queries ◦ Use Case 3: Graph Algorithms ◦ Use Case 4: Graph Embeddings 32
  • 33. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 33 Thank you! Contact us: [email protected] [email protected]