SlideShare a Scribd company logo
A panel discussion with
Angela Bonifati, Alexandru Iosup, and Hannes Voigt
The Future Is Big Graphs:
A Community View on Graph
Processing Systems
Graphs are everywhere!
Graphs provide a universal and simple
blueprint for how to look at the world and
make sense of it.
Tech-driving applications
= data science + multi-hop relationships
Everyone*
uses graphs!
*
not yet :-(
2
Graph analytics at scale
Multi-hop analysis faces combinatorial scaling problem: Every step deeper into the
graph multiplies the number of choices and cases to consider
Dealing with this technical challenge is not the typical business interest of a user.
Which challenges are ahead of us to ready graph processing systems for the future?
Challenges to overcome: Abstractions, Ecosystems, Performance
3
[Source: https://upload.wikimedia.org/wikipedia/commons/thumb/b/b5/Maze.svg]
Challenges require expertise of many different fields
● Computer systems
● Data management theory
● Data management systems
● Data analytics
● Visualization
● Computer human interaction
● ...
4
[collaborate by ArmOkay from the Noun Project]
Community View on Graph Processing System
● Dagstuhl Seminar 19491
● Big Graph Processing Systems
● December 1–6, 2019
● CACM article summarize
the findings of the seminar
5
https://cacm.acm.org/magazines/2021/9/255040-the-future-is-big-graphs/
Angela Bonifati
● Professor at Lyon 1 University and
the Liris CNRS lab in France
● Leads the Database group at
Lyon 1
● Adjunct Professor at the
University of Waterloo in Canada
● Co-authored several publications in
major venues of the data
management field, two books and
an invited paper in ACM Sigmod
Record 2018
● Program Chair of ACM Sigmod
2022 and the Chair of the EDBT
Executive Committee.
6
Hannes Voigt
● Software engineer at Neo4j
● PhD in computer science from
TU Dresden, Germany
● Works on graph query language
design and standardization
● Chairs the Expert Group on
GQL in the US standardization
organization INCITS and is a
US national expert in ISO/IEC
JTC1 SC32 Working Group 3
Alexandru Iosup
● Full professor at Vrije
Universiteit Amsterdam
● Chair of the Massivizing
Computer Systems research
group at the VU and of the
SPEC-RG Cloud group
● His work in distributed systems
and ecosystems has received
prestigious recognition, including
membership in the (Young)
Royal Academy of Arts and
Sciences of the Netherlands and
the 2016 Netherlands ICT
Researcher of the Year award.
was a professor at the Institute of Computer Science at University of Tartu, Estonia.
He passed away on March 25, 2020, at the age of 40.
Sherif Sakr
Abstractions
● Abstractions are widely used in computer science
and are fundamental for graph-oriented data to
incorporate the user’s conceptual and
domain-specific perspectives
● Several graph data models do exist (RDF,
Property Graphs and their variants etc.)
● The choice of the data model depends on the final
application
● Instead of choosing one data model, we
propose to leverage a lattice of data models
that are interoperable through mappings
● The lattice seamlessly characterizes
expressive power of the various data
models
● Graph operations and graph queries could
benefit from logical formalisms as to create
a “universal” data model
● Graph algebras need to be defined
alongside the standard query language in
order to respond to the needs of graph
processing systems in the near future
7
Ecosystems
Graph processing ecosystems are complex, coupling many diverse systems
● Characterize complex workflows, combining heterogeneous queries and
algorithms, managing and processing diverse datasets
● Develop standards as common technical foundation, thereby increasing the
mobility of applications, tooling, developers, users, and stakeholders
● Define a reference architecture for big graph processing and the discussion
around the design, development, and deployment of cloud computing solutions
● Beyond scale-up vs. scale-out: Automatically decide how to run a workload, on
what kind of heterogeneous infrastructure, to meet given SLAs
● Cope with dynamic and complex graph data streams beyond simple filtering,
projections, and aggregation
8
Performance
● Invent simple yet general, faithful
performance metrics (better: just one)
● From algorithm to pipeline to
(remote) workflows
● Create tractable benchmarks that produce
credible and reproducible results
● Understand result sensitivity to HPAD,
and (link to ML) predict performance and
port the predictions across platforms
● Tame hardware and software complexity →
system-under-test performance is difficult
to isolate, performance variability to tame
● Processes to curate, audit, and evolve
benchmarks → possibly through LDBC
● Scale workload generators
● Cover combined operations, with temporal,
spatial, and streaming aspects
● Explore trade-off specialization vs.
portability and interoperability
● Explore extreme specialization
● Explore standardizing graph libraries and
query languages
● Explore integrating graph processing into
cross-domain tools, e.g., ML, sim, DB
● A memex for big graph processing
9
Q: Pre-relational models represented data as graphs, too. Remember the CODASYL network
model? Is this all a big step back?
10
Q: So far data science blossomed quite well without challenges of “abstractions” and
“ecosystems” and such. A number of Python scripts do the job just fine, aren’t they?
Q: The article seems to favor views of some specific tech companies. Are there solution do not
require solving these challenges?
Discussion on Other Important Topics (Q&A)
The Future of Graph Processing Systems
If we want to leverage the power of graphs in breath, mere hacking won’t cut it!
Everyone to mastering all the technical details of scaling graph analytics won’t
either.
Vision: Have a common graph analytics application ready in 1 hour, including:
● Deployment in a specific ecosystem,
● Deployment from the laptop + digital notebook,
● The performance engineering that allows it to operate at scale
14
A panel discussion with
Angela Bonifati, Alexandru Iosup, and Hannes Voigt
The Future Is Big Graphs:
A Community View on Graph
Processing Systems
https://cacm.acm.org/magazines/2021/
9/255040-the-future-is-big-graphs/

More Related Content

Similar to The Future is Big Graphs: A Community View on Graph Processing Systems (20)

PDF
BIG GRAPH: TOOLS, TECHNIQUES, ISSUES, CHALLENGES AND FUTURE DIRECTIONS
cscpconf
 
PPTX
Big Stream Processing Systems, Big Graphs
Petr Novotný
 
PDF
Ling liu part 02:big graph processing
jins0618
 
PDF
Ling liu part 01:big graph processing
jins0618
 
PDF
Graph Stream Processing : spinning fast, large scale, complex analytics
Paris Carbone
 
PPTX
Big data week 2018 - Graph Analytics on Big Data
Christos Hadjinikolis
 
PPTX
Graph processing
yeahjs
 
PDF
Trade-offs in Processing Large Graphs: Representations, Storage, Systems and ...
Deepak Ajwani
 
PDF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
MLconf
 
PPTX
Distributed computing abstractions_data_science_6_june_2016_ver_0.4
Vijay Srinivas Agneeswaran, Ph.D
 
PDF
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Jason Riedy
 
PPTX
Large-scale Recommendation Systems on Just a PC
Aapo Kyrölä
 
PPTX
Hadoop World 2011: Hadoop and Graph Data Management: Challenges and Opportuni...
Cloudera, Inc.
 
PDF
Graph Realities
Connected Data World
 
PDF
Graphalytics: A big data benchmark for graph-processing platforms
Graph-TA
 
PDF
ScaleGraph - A High-Performance Library for Billion-Scale Graph Analytics
Toyotaro Suzumura
 
PPTX
Hadoop and Graph Data Management: Challenges and Opportunities
Daniel Abadi
 
PDF
Web-Scale Graph Analytics with Apache® Spark™
Databricks
 
PPTX
Apache Spark GraphX highlights.
Doug Needham
 
PDF
Big dataintegration rahm-part3Scalable and privacy-preserving data integratio...
ErhardRahm
 
BIG GRAPH: TOOLS, TECHNIQUES, ISSUES, CHALLENGES AND FUTURE DIRECTIONS
cscpconf
 
Big Stream Processing Systems, Big Graphs
Petr Novotný
 
Ling liu part 02:big graph processing
jins0618
 
Ling liu part 01:big graph processing
jins0618
 
Graph Stream Processing : spinning fast, large scale, complex analytics
Paris Carbone
 
Big data week 2018 - Graph Analytics on Big Data
Christos Hadjinikolis
 
Graph processing
yeahjs
 
Trade-offs in Processing Large Graphs: Representations, Storage, Systems and ...
Deepak Ajwani
 
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
MLconf
 
Distributed computing abstractions_data_science_6_june_2016_ver_0.4
Vijay Srinivas Agneeswaran, Ph.D
 
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Jason Riedy
 
Large-scale Recommendation Systems on Just a PC
Aapo Kyrölä
 
Hadoop World 2011: Hadoop and Graph Data Management: Challenges and Opportuni...
Cloudera, Inc.
 
Graph Realities
Connected Data World
 
Graphalytics: A big data benchmark for graph-processing platforms
Graph-TA
 
ScaleGraph - A High-Performance Library for Billion-Scale Graph Analytics
Toyotaro Suzumura
 
Hadoop and Graph Data Management: Challenges and Opportunities
Daniel Abadi
 
Web-Scale Graph Analytics with Apache® Spark™
Databricks
 
Apache Spark GraphX highlights.
Doug Needham
 
Big dataintegration rahm-part3Scalable and privacy-preserving data integratio...
ErhardRahm
 

More from Neo4j (20)

PDF
GraphSummit Singapore Master Deck - May 20, 2025
Neo4j
 
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
 
PPTX
Neo4j Knowledge for Customer Experience.pptx
Neo4j
 
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j
 
PDF
Neo4j: The Art of the Possible with Graph
Neo4j
 
PDF
Smarter Knowledge Graphs For Public Sector
Neo4j
 
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
Neo4j
 
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
Neo4j
 
PDF
ANZ Presentation: GraphSummit Melbourne 2024
Neo4j
 
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Neo4j
 
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Neo4j
 
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Neo4j
 
PDF
Démonstration Digital Twin Building Wire Management
Neo4j
 
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Neo4j
 
PDF
Démonstration Supply Chain - GraphTalk Paris
Neo4j
 
PDF
The Art of Possible - GraphTalk Paris Opening Session
Neo4j
 
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Neo4j
 
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Neo4j
 
PDF
Neo4j Graph Data Modelling Session - GraphTalk
Neo4j
 
PDF
Neo4j: The Art of Possible with Graph Technology
Neo4j
 
GraphSummit Singapore Master Deck - May 20, 2025
Neo4j
 
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
 
Neo4j Knowledge for Customer Experience.pptx
Neo4j
 
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j
 
Neo4j: The Art of the Possible with Graph
Neo4j
 
Smarter Knowledge Graphs For Public Sector
Neo4j
 
GraphRAG and Knowledge Graphs Exploring AI's Future
Neo4j
 
Matinée GenAI & GraphRAG Paris - Décembre 24
Neo4j
 
ANZ Presentation: GraphSummit Melbourne 2024
Neo4j
 
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Neo4j
 
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Neo4j
 
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Neo4j
 
Démonstration Digital Twin Building Wire Management
Neo4j
 
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Neo4j
 
Démonstration Supply Chain - GraphTalk Paris
Neo4j
 
The Art of Possible - GraphTalk Paris Opening Session
Neo4j
 
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Neo4j
 
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Neo4j
 
Neo4j Graph Data Modelling Session - GraphTalk
Neo4j
 
Neo4j: The Art of Possible with Graph Technology
Neo4j
 
Ad

Recently uploaded (20)

PDF
Upgrading to z_OS V2R4 Part 01 of 02.pdf
Flavio787771
 
PPTX
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
PDF
Rethinking Security Operations - SOC Evolution Journey.pdf
Haris Chughtai
 
PPT
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
PDF
HR agent at Mediq: Lessons learned on Agent Builder & Maestro by Tacstone Tec...
UiPathCommunity
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
CloudStack GPU Integration - Rohit Yadav
ShapeBlue
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PDF
Ampere Offers Energy-Efficient Future For AI And Cloud
ShapeBlue
 
PDF
Novus Safe Lite- What is Novus Safe Lite.pdf
Novus Hi-Tech
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PDF
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
PDF
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
PDF
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
Upgrading to z_OS V2R4 Part 01 of 02.pdf
Flavio787771
 
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
Rethinking Security Operations - SOC Evolution Journey.pdf
Haris Chughtai
 
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
HR agent at Mediq: Lessons learned on Agent Builder & Maestro by Tacstone Tec...
UiPathCommunity
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
CloudStack GPU Integration - Rohit Yadav
ShapeBlue
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
Ampere Offers Energy-Efficient Future For AI And Cloud
ShapeBlue
 
Novus Safe Lite- What is Novus Safe Lite.pdf
Novus Hi-Tech
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
Ad

The Future is Big Graphs: A Community View on Graph Processing Systems

  • 1. A panel discussion with Angela Bonifati, Alexandru Iosup, and Hannes Voigt The Future Is Big Graphs: A Community View on Graph Processing Systems
  • 2. Graphs are everywhere! Graphs provide a universal and simple blueprint for how to look at the world and make sense of it. Tech-driving applications = data science + multi-hop relationships Everyone* uses graphs! * not yet :-( 2
  • 3. Graph analytics at scale Multi-hop analysis faces combinatorial scaling problem: Every step deeper into the graph multiplies the number of choices and cases to consider Dealing with this technical challenge is not the typical business interest of a user. Which challenges are ahead of us to ready graph processing systems for the future? Challenges to overcome: Abstractions, Ecosystems, Performance 3 [Source: https://upload.wikimedia.org/wikipedia/commons/thumb/b/b5/Maze.svg]
  • 4. Challenges require expertise of many different fields ● Computer systems ● Data management theory ● Data management systems ● Data analytics ● Visualization ● Computer human interaction ● ... 4 [collaborate by ArmOkay from the Noun Project]
  • 5. Community View on Graph Processing System ● Dagstuhl Seminar 19491 ● Big Graph Processing Systems ● December 1–6, 2019 ● CACM article summarize the findings of the seminar 5 https://cacm.acm.org/magazines/2021/9/255040-the-future-is-big-graphs/
  • 6. Angela Bonifati ● Professor at Lyon 1 University and the Liris CNRS lab in France ● Leads the Database group at Lyon 1 ● Adjunct Professor at the University of Waterloo in Canada ● Co-authored several publications in major venues of the data management field, two books and an invited paper in ACM Sigmod Record 2018 ● Program Chair of ACM Sigmod 2022 and the Chair of the EDBT Executive Committee. 6 Hannes Voigt ● Software engineer at Neo4j ● PhD in computer science from TU Dresden, Germany ● Works on graph query language design and standardization ● Chairs the Expert Group on GQL in the US standardization organization INCITS and is a US national expert in ISO/IEC JTC1 SC32 Working Group 3 Alexandru Iosup ● Full professor at Vrije Universiteit Amsterdam ● Chair of the Massivizing Computer Systems research group at the VU and of the SPEC-RG Cloud group ● His work in distributed systems and ecosystems has received prestigious recognition, including membership in the (Young) Royal Academy of Arts and Sciences of the Netherlands and the 2016 Netherlands ICT Researcher of the Year award. was a professor at the Institute of Computer Science at University of Tartu, Estonia. He passed away on March 25, 2020, at the age of 40. Sherif Sakr
  • 7. Abstractions ● Abstractions are widely used in computer science and are fundamental for graph-oriented data to incorporate the user’s conceptual and domain-specific perspectives ● Several graph data models do exist (RDF, Property Graphs and their variants etc.) ● The choice of the data model depends on the final application ● Instead of choosing one data model, we propose to leverage a lattice of data models that are interoperable through mappings ● The lattice seamlessly characterizes expressive power of the various data models ● Graph operations and graph queries could benefit from logical formalisms as to create a “universal” data model ● Graph algebras need to be defined alongside the standard query language in order to respond to the needs of graph processing systems in the near future 7
  • 8. Ecosystems Graph processing ecosystems are complex, coupling many diverse systems ● Characterize complex workflows, combining heterogeneous queries and algorithms, managing and processing diverse datasets ● Develop standards as common technical foundation, thereby increasing the mobility of applications, tooling, developers, users, and stakeholders ● Define a reference architecture for big graph processing and the discussion around the design, development, and deployment of cloud computing solutions ● Beyond scale-up vs. scale-out: Automatically decide how to run a workload, on what kind of heterogeneous infrastructure, to meet given SLAs ● Cope with dynamic and complex graph data streams beyond simple filtering, projections, and aggregation 8
  • 9. Performance ● Invent simple yet general, faithful performance metrics (better: just one) ● From algorithm to pipeline to (remote) workflows ● Create tractable benchmarks that produce credible and reproducible results ● Understand result sensitivity to HPAD, and (link to ML) predict performance and port the predictions across platforms ● Tame hardware and software complexity → system-under-test performance is difficult to isolate, performance variability to tame ● Processes to curate, audit, and evolve benchmarks → possibly through LDBC ● Scale workload generators ● Cover combined operations, with temporal, spatial, and streaming aspects ● Explore trade-off specialization vs. portability and interoperability ● Explore extreme specialization ● Explore standardizing graph libraries and query languages ● Explore integrating graph processing into cross-domain tools, e.g., ML, sim, DB ● A memex for big graph processing 9
  • 10. Q: Pre-relational models represented data as graphs, too. Remember the CODASYL network model? Is this all a big step back? 10 Q: So far data science blossomed quite well without challenges of “abstractions” and “ecosystems” and such. A number of Python scripts do the job just fine, aren’t they? Q: The article seems to favor views of some specific tech companies. Are there solution do not require solving these challenges? Discussion on Other Important Topics (Q&A)
  • 11. The Future of Graph Processing Systems If we want to leverage the power of graphs in breath, mere hacking won’t cut it! Everyone to mastering all the technical details of scaling graph analytics won’t either. Vision: Have a common graph analytics application ready in 1 hour, including: ● Deployment in a specific ecosystem, ● Deployment from the laptop + digital notebook, ● The performance engineering that allows it to operate at scale 14
  • 12. A panel discussion with Angela Bonifati, Alexandru Iosup, and Hannes Voigt The Future Is Big Graphs: A Community View on Graph Processing Systems https://cacm.acm.org/magazines/2021/ 9/255040-the-future-is-big-graphs/