Neo4j Graph DB
{A step ahead towards non-RDBMS}
Presented By:
Mangal Dev
1
Agenda
• N0SQL
• Graph Database
• Neo4J
• Samples
Brew install neo4j
2
3
Interdependency and complexity of data.Informationconnectivity
Text
Documents
Hypertext
Feeds
Blogs
Wikis
UGC
Tagging
RDFa
Social networks
4
NOSQL Categories
 Key-Values Stores
Dynamo
Riak
 Column Family Stores
Big Table -> Google
Casandra -> Facebook
HBase -> Apache
 Document Based DB
CouchDB
MongoDB
 Graph Databases.
AllegroDB
FlockDB -> Twitter
InfiniteGraph
Neo4j
Sones
5
6
7
Why GraphDB ?
• Flexibility
No Normalization/De-normalization
Relational databases deal poorly with relationships.
• Agility
Schema free nature
Foreign Keys turns into dangling refrences
• Reciprocal Queries
• Usecases
SocialNetworks
BioInformatics
DataManagement
NetworkManagement
CloudManagement
GeoData
What does ‘R’ stand for in RDBMS ?
8
Graph Databases
9
Property Graph Model
WORKS_FOR
Name: Gracenote
first name: Neeti
first name: Mangal
late name: Dev
first name: Rajan
Name: Bangalore
10
11
Database full of linked nodes
Stores data as nodes and relationships
12
Some More Info..
• Widely Used
• Full ACID transactions
• Schema free, bottom-up data model design
• It’s a stable and in active developlment since
mid 2000
• High Performance, can cover upto 2 billion
nodes.
• Lot of Bindings
13
How do you query and work with
this graph database?
• Run it as
– Embedded
– Standalone
• Query it with
– Cyhper from Neo4j Browser
– Cypher from Java
– Rest API
– Rest API using Cypher query
– Programatically
– Gremlin
14
15
Cypher Query
• MATCH
– Matches the graph pattern in the real graph.
• WHERE
– Filters using predicates or anchors pattern elements.
• RETURN
– Returns and projects result data, also handles aggregation.
• ORDER BY
– Sorts the query result.
• SKIP/LIMIT
– Paginates the query result.
16
Go to Console…
• Neo4j start
• Go to url :
– http://localhost:7474/browser/
17
Cypher Basics
18
19
20
21
22
Core API
Neo4j Logical Architecture
REST API
JVM Language Bindings
Traversal Framework
Caches
Memory-Mapped (N)IO
Filesystem
Java Ruby Clojure…
Graph Matching
23
Graph Algorithms
• algorithm can have one of these values:
– shortestPath
– allSimplePaths
– allPaths
– dijkstra (optionally with cost_property and
default_cost parameters)
24
Graph Algorithms
• shortestPath()
MATCH p =
ShortestPath
( (keanu:Person)-[:KNOWS*]-(kevin:Person) )
WHERE
keanu.name="Keanu Reeves"
and
kevin.name = "Kevin Bacon"
RETURN
length(p)
25
Indexing a Graph?
• Graphs are their own indexes!
• But sometimes we want short-cuts to well-
known nodes
• Can do this in our own code
– Just keep a reference to any interesting nodes
e.g.
CREATE INDEX ON :MOVIE(title);
Don’t index every node!
26
Inbuilt Lucene Support
• Default index implementation for Neo4j
• Each index supports nodes or relationships
• Supports exact and regex-based matching
– Query(‘index_name’,’pattern’)
• Supports scoring
– Number of hits in the index for a given item
– Great for recommendations!
27
Scalability
• The HA component supports master-slave
replication
• But Scaling graphs are HARD.
• Acc to Emil -> Superb performance till some
billions of nodes
28
PEFORMANCE ANALYSIS
29
Pros and Cons
• Strengths
– Powerful data model
– Fast
• For connected data, can be many orders of magnitude
faster than RDBMS
• Weaknesses:
– Sharding
• Though they can scale reasonably well
• And for some domains you can shard too!
30
ENOUGH THEORY!!!
31
Sample Exercises
• Find all Persons who killed other in
Mahabharat.
• Find movies in which actor has acted along
with directed it.
• the five actors who have acted in the most
movies
• Shortest path between Kunti and Sehdev
• Find persons got killed by Kunti’s sons
32
References
• http://www.neo4j.org/learn/online_course
• http://www.infoq.com/articles/graph-nosql-
neo4j
• http://www.manning.com/partner/Neo4J_me
ap_ch01.pdf
33

More Related Content

PDF
Warg
PPTX
Scala and Spark are Ideal for Big Data
PPTX
ELK at LinkedIn - Kafka, scaling, lessons learned
PPTX
NATE-Central-Log
PDF
Polyglot persistence @ netflix (CDE Meetup)
PDF
Scala in Model-Driven development for Apparel Cloud Platform
PPTX
Real Time Data Processing With Spark Streaming, Node.js and Redis with Visual...
PDF
20160512 apache-spark-for-everyone
Warg
Scala and Spark are Ideal for Big Data
ELK at LinkedIn - Kafka, scaling, lessons learned
NATE-Central-Log
Polyglot persistence @ netflix (CDE Meetup)
Scala in Model-Driven development for Apparel Cloud Platform
Real Time Data Processing With Spark Streaming, Node.js and Redis with Visual...
20160512 apache-spark-for-everyone

What's hot (20)

PDF
Solr cloud the 'search first' nosql database extended deep dive
PDF
Introduction to apache spark
PDF
Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz
PPTX
seminar presentation on apache-spark
PDF
Introduction to df
PPTX
Scalable Object Storage with Apache CloudStack and Apache Hadoop
PDF
Apache Arrow: Present and Future @ ScaledML 2020
PDF
Apache Arrow Workshop at VLDB 2019 / BOSS Session
PDF
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming
PDF
introduction to Neo4j (Tabriz Software Open Talks)
PDF
Building DSLs with Scala
PDF
Apache Spark MLlib's Past Trajectory and New Directions with Joseph Bradley
PDF
Scala and Spark are Ideal for Big Data - Data Science Pop-up Seattle
PPTX
Introduction to CosmosDB - Azure Bootcamp 2018
PDF
GraphQL API on a Serverless Environment
PPTX
Apache Spark in Industry
PDF
SAIS2018 - Fact Store At Netflix Scale
PDF
Anatomy of Data Frame API : A deep dive into Spark Data Frame API
PPTX
Riak perf wins
PPTX
Ruby to Scala in 9 weeks
Solr cloud the 'search first' nosql database extended deep dive
Introduction to apache spark
Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz
seminar presentation on apache-spark
Introduction to df
Scalable Object Storage with Apache CloudStack and Apache Hadoop
Apache Arrow: Present and Future @ ScaledML 2020
Apache Arrow Workshop at VLDB 2019 / BOSS Session
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming
introduction to Neo4j (Tabriz Software Open Talks)
Building DSLs with Scala
Apache Spark MLlib's Past Trajectory and New Directions with Joseph Bradley
Scala and Spark are Ideal for Big Data - Data Science Pop-up Seattle
Introduction to CosmosDB - Azure Bootcamp 2018
GraphQL API on a Serverless Environment
Apache Spark in Industry
SAIS2018 - Fact Store At Netflix Scale
Anatomy of Data Frame API : A deep dive into Spark Data Frame API
Riak perf wins
Ruby to Scala in 9 weeks
Ad

Similar to Neo4j tms (20)

PPTX
NoSql Data Management
PDF
Spring one2gx2010 spring-nonrelational_data
PPTX
Graph Databases
PPT
NoSQL_Night
PDF
Spark Summit EU talk by Shay Nativ and Dvir Volk
PPT
Apache Cassandra training. Overview and Basics
PPT
Wmware NoSQL
PDF
Manuel Hurtado. Couchbase paradigma4oct
PPTX
No SQL- The Future Of Data Storage
PPTX
Why Functional Programming Is Important in Big Data Era
PPT
Processing Large Graphs
PPTX
Paris Data Geek - Spark Streaming
PPTX
mongodb_DS.pptx
PPTX
Yes sql08 inmemorydb
PDF
Apache Spark Presentation good for big data
PPT
Writing DSL's in Scala
PDF
Apache Spark Overview @ ferret
PPTX
NoSQL
PPTX
Nashville analytics summit aug9 no sql mike king dell v1.5
PPTX
Introducción a NoSQL
NoSql Data Management
Spring one2gx2010 spring-nonrelational_data
Graph Databases
NoSQL_Night
Spark Summit EU talk by Shay Nativ and Dvir Volk
Apache Cassandra training. Overview and Basics
Wmware NoSQL
Manuel Hurtado. Couchbase paradigma4oct
No SQL- The Future Of Data Storage
Why Functional Programming Is Important in Big Data Era
Processing Large Graphs
Paris Data Geek - Spark Streaming
mongodb_DS.pptx
Yes sql08 inmemorydb
Apache Spark Presentation good for big data
Writing DSL's in Scala
Apache Spark Overview @ ferret
NoSQL
Nashville analytics summit aug9 no sql mike king dell v1.5
Introducción a NoSQL
Ad

Recently uploaded (20)

PPTX
4Seller: The All-in-One Multi-Channel E-Commerce Management Platform for Glob...
PDF
MiniTool Power Data Recovery 12.6 Crack + Portable (Latest Version 2025)
PDF
E-Commerce Website Development Companyin india
PDF
Website Design & Development_ Professional Web Design Services.pdf
PPTX
Viber For Windows 25.7.1 Crack + Serial Keygen
PPTX
HackYourBrain__UtrechtJUG__11092025.pptx
PPTX
Chapter 1 - Transaction Processing and Mgt.pptx
PPTX
DevOpsDays Halifax 2025 - Building 10x Organizations Using Modern Productivit...
PDF
infoteam HELLAS company profile 2025 presentation
PPT
3.Software Design for software engineering
PDF
novaPDF Pro 11.9.482 Crack + License Key [Latest 2025]
DOC
UTEP毕业证学历认证,宾夕法尼亚克拉里恩大学毕业证未毕业
PPTX
Presentation by Samna Perveen And Subhan Afzal.pptx
PDF
Guide to Food Delivery App Development.pdf
PDF
SOFTWARE ENGINEERING Software Engineering (3rd Edition) by K.K. Aggarwal & Yo...
PPTX
Computer Software - Technology and Livelihood Education
PDF
CCleaner 6.39.11548 Crack 2025 License Key
PPTX
hospital managemt ,san.dckldnklcdnkdnkdnjadnjdjn
PDF
Practical Indispensable Project Management Tips for Delivering Successful Exp...
PPTX
Matchmaking for JVMs: How to Pick the Perfect GC Partner
4Seller: The All-in-One Multi-Channel E-Commerce Management Platform for Glob...
MiniTool Power Data Recovery 12.6 Crack + Portable (Latest Version 2025)
E-Commerce Website Development Companyin india
Website Design & Development_ Professional Web Design Services.pdf
Viber For Windows 25.7.1 Crack + Serial Keygen
HackYourBrain__UtrechtJUG__11092025.pptx
Chapter 1 - Transaction Processing and Mgt.pptx
DevOpsDays Halifax 2025 - Building 10x Organizations Using Modern Productivit...
infoteam HELLAS company profile 2025 presentation
3.Software Design for software engineering
novaPDF Pro 11.9.482 Crack + License Key [Latest 2025]
UTEP毕业证学历认证,宾夕法尼亚克拉里恩大学毕业证未毕业
Presentation by Samna Perveen And Subhan Afzal.pptx
Guide to Food Delivery App Development.pdf
SOFTWARE ENGINEERING Software Engineering (3rd Edition) by K.K. Aggarwal & Yo...
Computer Software - Technology and Livelihood Education
CCleaner 6.39.11548 Crack 2025 License Key
hospital managemt ,san.dckldnklcdnkdnkdnjadnjdjn
Practical Indispensable Project Management Tips for Delivering Successful Exp...
Matchmaking for JVMs: How to Pick the Perfect GC Partner

Neo4j tms

  • 1. Neo4j Graph DB {A step ahead towards non-RDBMS} Presented By: Mangal Dev 1
  • 2. Agenda • N0SQL • Graph Database • Neo4J • Samples Brew install neo4j 2
  • 3. 3
  • 4. Interdependency and complexity of data.Informationconnectivity Text Documents Hypertext Feeds Blogs Wikis UGC Tagging RDFa Social networks 4
  • 5. NOSQL Categories  Key-Values Stores Dynamo Riak  Column Family Stores Big Table -> Google Casandra -> Facebook HBase -> Apache  Document Based DB CouchDB MongoDB  Graph Databases. AllegroDB FlockDB -> Twitter InfiniteGraph Neo4j Sones 5
  • 6. 6
  • 7. 7
  • 8. Why GraphDB ? • Flexibility No Normalization/De-normalization Relational databases deal poorly with relationships. • Agility Schema free nature Foreign Keys turns into dangling refrences • Reciprocal Queries • Usecases SocialNetworks BioInformatics DataManagement NetworkManagement CloudManagement GeoData What does ‘R’ stand for in RDBMS ? 8
  • 10. Property Graph Model WORKS_FOR Name: Gracenote first name: Neeti first name: Mangal late name: Dev first name: Rajan Name: Bangalore 10
  • 11. 11
  • 12. Database full of linked nodes Stores data as nodes and relationships 12
  • 13. Some More Info.. • Widely Used • Full ACID transactions • Schema free, bottom-up data model design • It’s a stable and in active developlment since mid 2000 • High Performance, can cover upto 2 billion nodes. • Lot of Bindings 13
  • 14. How do you query and work with this graph database? • Run it as – Embedded – Standalone • Query it with – Cyhper from Neo4j Browser – Cypher from Java – Rest API – Rest API using Cypher query – Programatically – Gremlin 14
  • 15. 15
  • 16. Cypher Query • MATCH – Matches the graph pattern in the real graph. • WHERE – Filters using predicates or anchors pattern elements. • RETURN – Returns and projects result data, also handles aggregation. • ORDER BY – Sorts the query result. • SKIP/LIMIT – Paginates the query result. 16
  • 17. Go to Console… • Neo4j start • Go to url : – http://localhost:7474/browser/ 17
  • 19. 19
  • 20. 20
  • 21. 21
  • 22. 22
  • 23. Core API Neo4j Logical Architecture REST API JVM Language Bindings Traversal Framework Caches Memory-Mapped (N)IO Filesystem Java Ruby Clojure… Graph Matching 23
  • 24. Graph Algorithms • algorithm can have one of these values: – shortestPath – allSimplePaths – allPaths – dijkstra (optionally with cost_property and default_cost parameters) 24
  • 25. Graph Algorithms • shortestPath() MATCH p = ShortestPath ( (keanu:Person)-[:KNOWS*]-(kevin:Person) ) WHERE keanu.name="Keanu Reeves" and kevin.name = "Kevin Bacon" RETURN length(p) 25
  • 26. Indexing a Graph? • Graphs are their own indexes! • But sometimes we want short-cuts to well- known nodes • Can do this in our own code – Just keep a reference to any interesting nodes e.g. CREATE INDEX ON :MOVIE(title); Don’t index every node! 26
  • 27. Inbuilt Lucene Support • Default index implementation for Neo4j • Each index supports nodes or relationships • Supports exact and regex-based matching – Query(‘index_name’,’pattern’) • Supports scoring – Number of hits in the index for a given item – Great for recommendations! 27
  • 28. Scalability • The HA component supports master-slave replication • But Scaling graphs are HARD. • Acc to Emil -> Superb performance till some billions of nodes 28
  • 30. Pros and Cons • Strengths – Powerful data model – Fast • For connected data, can be many orders of magnitude faster than RDBMS • Weaknesses: – Sharding • Though they can scale reasonably well • And for some domains you can shard too! 30
  • 32. Sample Exercises • Find all Persons who killed other in Mahabharat. • Find movies in which actor has acted along with directed it. • the five actors who have acted in the most movies • Shortest path between Kunti and Sehdev • Find persons got killed by Kunti’s sons 32

Editor's Notes

  • #4: The exponential growth of the volume of data generated by users, systems and sensors, further accelerated by some big distributed systems like Amazon, Google and other cloud services. Every minute some systems are persisting around GBs of data . Quora Facebook example
  • #5: The increasing interdependency and complexity of data, accelerated by the Internet,  Web2.0, social networks and open and standardized access to data sources from a large number of different systems.
  • #9: individual, isolated, and discrete data items, but also the connections between them And while we could easily fit the discrete data in relational tables, the connected data was more challenging to store and tremendously slow to query.     the property graph model intuitive and easy to understand.