SlideShare a Scribd company logo
DataStax Enterprise: The Multi-Model Data Platform
1 Multi-Model Defined
2 The evolution of Models
3 Graph Databases Overview
4 The DataStax Approach to Graph Databases
5 DataStax in the Open Source Graph Community
6 Conclusion
2© 2015. All Rights Reserved.
Multi Model Defined
© 2015. All Rights Reserved. 3
Most database management systems are
organized around a single data model that
determines how data can be organized,
stored, and manipulated. In contrast, a multi-
model database is designed to support
multiple data models against a single,
integrated backend.
Source - https://en.wikipedia.org/wiki/Multi-model_database
The evolution of Models
© 2015. All Rights Reserved. 4
Application
Application
Application
(OLTP and OLAP)
Data Access Abstraction
RDBMS / SQL Polyglot
Persistence
Multi
Model
DSE Multi Model
© 2015. All Rights Reserved. 5
Multiple (polygot) Models exposed via
cohesive interaction mechanisms, (APIs)
for OLTP and OLAP workloads, (mixed workload)
with a unified persistence layer, (Apache Cassandra)
providing GR, always on characteristics
within a TCO efficient data base platform
for a variety of use cases
Cassandra
©2015 DataStax Confidential. Do not distribute without consent.
 6
DataStax Enterprise – Wide Row / MVs
C1 C2
MV1c1
MV1c2
Agg1c1
Agg2c1
©2015 DataStax Confidential. Do not distribute without consent.
 7
DataStax Enterprise - JSON
Inserting JSON data is easy
Reading JSON data is easy
Finding JSON errors is easy
Introduction to Graph Databases
© 2015. All Rights Reserved. 8
What is a Graph Database?
©2015 DataStax Confidential. Do not distribute without consent.
•  Store, manage and query highly connected data
•  Data stored as nodes (vertices), edges and properties to represent
a domain model
•  By explicitly embedding relationships in the data model, you store
a more logical business model using the natural data access
language Gremlin
•  Think of a graph as

a pre-joined

database
Choose the Right Model to Fit your Business Needs
© 2015 DataStax, All Rights Reserved. Company Confidential 10
DSE Wide Row
- Build and maintain models using
CQL’s DDL features
- Super-fast CRUD at scale with CQL
DML features
- Good option for denormalized
schemas with high-throughput
requirements
- Perfect fit for IoT applications that
require consuming enormous
amounts of data with specific
data retrieval requirements:
product catalogs, high-volume
messaging systems, collecting
and storing sensor data
DSE Graph
- Flexible schema that is easy to modify
and maintain with Gremlin
- Clearly maps business semantics to a
logical model for easy maintenance
and understanding
- Ideal model for highly-connected data
models
- Perfect fit for social-engagement models,
recommendation engines and IT
network / device management
- Update and query the graph in real-time
with easy-to-learn open-source
Gremlin language
- Good option for transitioning from slow
RDBMS 3NF models with lots of
JOINs
What is DataStax Enterprise (DSE) Graph? 
©2015 DataStax 
•  Highly scalable graph database for modern web, mobile, and IoT applications that
need to manage highly connected and heterogeneous data
•  Built-in support for real-time search, and analytic graph queries via tight integration
with the DSE platform
•  A property graph model native inside the DataStax product, engineered specifically
for Cassandra
•  Store & find relationships in data fast and easy on huge graphs
DSE Graph: Built-in Scalability 
©2015 DataStax 
•  Scale out Graph vs. Scale up only
•  Graph partitioning built on Cassandra’s scale-out architecture
•  Graph index structures integrated into Cassandra
•  Domain model maps more naturally to data model, allowing for greater
understanding between business and IT
•  Traverse millions of relationships in a short period of
time, faster than modeling the data in RDBMS
•  Flexible data model that can be easily adapted to
business changes
DSE Graph: Integrated Search, Analytics and Ops
©2015 DataStax 
•  Real-time traversals over complex-structured graph data
•  Integrates with DSE Search to mix search with traversal queries
•  Integrates with DSE Analytics and Spark to support OLAP and breadth-first
graph traversals
•  Iterative graph analytics like PageRank or other centrality measures
•  Reporting and aggregates over graph data
•  Integrated with DataStax OpsCenter
Graph Database Use Cases
© 2015. All Rights Reserved. 14
Additional Graph Use Cases
360 Degree View of Your Customer
•  Collect massive amounts of data point about your customer
•  Data collected from social networks, web analytics, digital ads, mobile devices, CRM
•  Bring heterogeneous customer data together into DSE Graph
•  Uncover buying patterns and customer behaviors
•  Graph becomes a master data hub for customer data
•  Use the graph customer hub to build better products
for your target customers
•  Keep the customers you already have with
customer intelligence
Customer
360 View
Social
Store
Sensors
Email
CRM
Mobile
Weblogs
Additional Graph Use Cases
IT Network and Device Management
•  Allows IT to monitor, manage and protect corporate networks and devices
(laptops, iPads, mobile phones, etc.)
•  Requires understanding of network topologies and relationships between
devices, interfaces, equipment, people, services …
•  A traditional RDBMS would require
expensive query-time joins
•  A graph model intrinsically knows how
to traverse the topology because the
relationships are already stored
•  This makes for quick & easy recognition
of problems, root cause analysis and
event correlation
DataStax in the Graph Open Source
Community
TinkerPop / Gremlin
© 2015. All Rights Reserved. 17
DataStax Role in TinkerPop
©2015 DataStax 
•  DataStax utilizes the TinkerPop framework for the DSE Graph
product
•  DataStax will contribute to the TinkerPop community and is
heavily invested in the success of the Gremlin language
•  DataStax will provide resource guides, documentation,
samples and training on building and querying graphs
with Gremlin, using DSE Graph as the graph engine
Gremlin
The open-source standard graph query language
•  DataStax contributes and supports the Apache TinkerPop community, along with the Gremlin Graph query
language
•  g.V.hasLabel('person').as('a').out('knows').as('b').select('a','b').by('age')
.by('age')
"for all people in the graph, give me the ages of the people on each end of a friendship relationship“
•  g.V.has('name','marko').out('knows').out('mother').outE('worksFor').
has('time',between(2001,2002)).inV.name



“what are the names of the places that marko's friends' mothers worked for from 2001 to 2002”

Deep traversal == multiple levels of query-time joins in RDBMS
Recommendation Query – RDBMS vs. Graph
©2015 DataStax 
SELECT TOP (5) [t14].[ProductName]
FROM (SELECT COUNT(*) AS [value],
[t13].[ProductName]
FROM [customers] AS [t0]
CROSS APPLY (SELECT [t9].[ProductName]
FROM [orders] AS [t1]
CROSS JOIN [order details] AS [t2]
INNER JOIN [products] AS [t3]
ON [t3].[ProductID] = [t2].[ProductID]
CROSS JOIN [order details] AS [t4]
INNER JOIN [orders] AS [t5]
ON [t5].[OrderID] = [t4].[OrderID]
LEFT JOIN [customers] AS [t6]
ON [t6].[CustomerID] = [t5].[CustomerID]
CROSS JOIN ([orders] AS [t7]
CROSS JOIN [order details] AS [t8]
INNER JOIN [products] AS [t9]
ON [t9].[ProductID] = [t8].[ProductID])
WHERE NOT EXISTS(SELECT NULL AS [EMPTY]
FROM [orders] AS [t10]
CROSS JOIN [order details] AS [t11]
INNER JOIN [products] AS [t12]
ON [t12].[ProductID] = [t11].[ProductID]
WHERE [t9].[ProductID] = [t12].[ProductID]
AND [t10].[CustomerID] = [t0].[CustomerID]
AND [t11].[OrderID] = [t10].[OrderID])
AND [t6].[CustomerID] <> [t0].[CustomerID]
AND [t1].[CustomerID] = [t0].[CustomerID]
AND [t2].[OrderID] = [t1].[OrderID]
AND [t4].[ProductID] = [t3].[ProductID]
AND [t7].[CustomerID] = [t6].[CustomerID]
AND [t8].[OrderID] = [t7].[OrderID]) AS [t13]
WHERE [t0].[CustomerID] = N'ALFKI'
GROUP BY [t13].[ProductName]) AS [t14]
ORDER BY [t14].[value] DESC
g.V('customerId','ALFKI').as('customer') 
.out('ordered').out('contains').out('is').as('products') 
.in('is').in('contains').in('ordered').except('customer') 
.out('ordered').out('contains').out('is').except('products') 
.groupCount().cap().orderMap(T.decr)[0..<5].productName
VS.
Thank you

More Related Content

PDF
DataStax: Making a Difference with Smart Analytics
DataStax Academy
 
PDF
DataStax: Steps to successfully implementing NoSQL in the enterprise
DataStax Academy
 
PPTX
StreamSet ETL tool
SwapnilSHampi
 
PDF
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
PPTX
Snowflake + Power BI: Cloud Analytics for Everyone
Angel Abundez
 
PDF
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Databricks
 
PDF
The API Lie
SnapLogic
 
PPTX
How to Realize an Additional 270% ROI on Snowflake
AtScale
 
DataStax: Making a Difference with Smart Analytics
DataStax Academy
 
DataStax: Steps to successfully implementing NoSQL in the enterprise
DataStax Academy
 
StreamSet ETL tool
SwapnilSHampi
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
Snowflake + Power BI: Cloud Analytics for Everyone
Angel Abundez
 
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Databricks
 
The API Lie
SnapLogic
 
How to Realize an Additional 270% ROI on Snowflake
AtScale
 

What's hot (17)

PPTX
Automate and Optimize Data Warehouse Migration to Snowflake
Impetus Technologies
 
PPTX
Master the Multi-Clustered Data Warehouse - Snowflake
Matillion
 
PPTX
Dive Into Data Lakes
Matillion
 
PDF
Introducing MLflow for End-to-End Machine Learning on Databricks
Databricks
 
PPTX
Altis Webinar: Use Cases For The Modern Data Platform
Altis Consulting
 
PPTX
Chug building a data lake in azure with spark and databricks
Brandon Berlinrut
 
PPTX
Leveraging cloud database connectors to automate analytics in alteryx
Grazitti Interactive
 
PPTX
Delivering digital transformation and business impact with io t, machine lear...
Robert Sanders
 
PPTX
From Events to Networks: Time Series Analysis on Scale
Dr. Mirko Kämpf
 
PPTX
A Design Approach To Drive Business Innovation Nov
Certus Solutions
 
PPTX
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
DataStax
 
PDF
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Databricks
 
PDF
About Pragmatic Works
MILL5
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PDF
The Power Of Snowflake for SAP BusinessObjects
Wiiisdom
 
PDF
Achieving Agility and Scale for Your Data Lake - Talend
Talend
 
PPTX
Altis AWS Snowflake Practice
SamanthaSwain7
 
Automate and Optimize Data Warehouse Migration to Snowflake
Impetus Technologies
 
Master the Multi-Clustered Data Warehouse - Snowflake
Matillion
 
Dive Into Data Lakes
Matillion
 
Introducing MLflow for End-to-End Machine Learning on Databricks
Databricks
 
Altis Webinar: Use Cases For The Modern Data Platform
Altis Consulting
 
Chug building a data lake in azure with spark and databricks
Brandon Berlinrut
 
Leveraging cloud database connectors to automate analytics in alteryx
Grazitti Interactive
 
Delivering digital transformation and business impact with io t, machine lear...
Robert Sanders
 
From Events to Networks: Time Series Analysis on Scale
Dr. Mirko Kämpf
 
A Design Approach To Drive Business Innovation Nov
Certus Solutions
 
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
DataStax
 
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Databricks
 
About Pragmatic Works
MILL5
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
The Power Of Snowflake for SAP BusinessObjects
Wiiisdom
 
Achieving Agility and Scale for Your Data Lake - Talend
Talend
 
Altis AWS Snowflake Practice
SamanthaSwain7
 
Ad

Viewers also liked (20)

PDF
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
DataStax Academy
 
PDF
DataStax: The Whys of NoSQL
DataStax Academy
 
PDF
DataStax: Setting Your Database Management on Autopilot with OpsCenter
DataStax Academy
 
PDF
Petabridge: The New .NET Enterprise Stack
DataStax Academy
 
PDF
DataStax: Ramping up Cassandra QA
DataStax Academy
 
DOCX
cover pratikum 1
Tofan Tri Prasetyo
 
DOCX
Balance de energía- Síntesis de nanotubos
Mony Gonzalez
 
DOCX
Learn to get your startup funded
eTailing India
 
PDF
Interior Design Ain co.
Ain Architecture
 
PDF
Infosys Ltd: Performance Tuning - A Key to Successful Cassandra Migration
DataStax Academy
 
PDF
Journal Practicum AkSB
Ahmad Fadhil Rahim
 
PDF
Reltio: Powering Enterprise Data-driven Applications with Cassandra
DataStax Academy
 
PDF
Glassbeam: Ad-hoc Analytics on Internet of Complex Things with Apache Cassand...
DataStax Academy
 
PDF
DataStax: What's New in Apache TinkerPop - the Graph Computing Framework
DataStax Academy
 
PDF
Target: Escaping Disco-Era Data Modeling
DataStax Academy
 
PDF
Cisco UCS Integrated Infrastructure for Big Data with Cassandra
DataStax Academy
 
PDF
CAÇANDO DEUS SERVINDO AO HOMEM - EBOOK
Robson Lima
 
PDF
Roche Digital Pathology Vision
Roche Tissue Diagnostics
 
PPTX
Data warehouse implementation design for a Retail business
Arsalan Qadri
 
PDF
Cloud Customer Architecture for Hybrid Integration
Cloud Standards Customer Council
 
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
DataStax Academy
 
DataStax: The Whys of NoSQL
DataStax Academy
 
DataStax: Setting Your Database Management on Autopilot with OpsCenter
DataStax Academy
 
Petabridge: The New .NET Enterprise Stack
DataStax Academy
 
DataStax: Ramping up Cassandra QA
DataStax Academy
 
cover pratikum 1
Tofan Tri Prasetyo
 
Balance de energía- Síntesis de nanotubos
Mony Gonzalez
 
Learn to get your startup funded
eTailing India
 
Interior Design Ain co.
Ain Architecture
 
Infosys Ltd: Performance Tuning - A Key to Successful Cassandra Migration
DataStax Academy
 
Journal Practicum AkSB
Ahmad Fadhil Rahim
 
Reltio: Powering Enterprise Data-driven Applications with Cassandra
DataStax Academy
 
Glassbeam: Ad-hoc Analytics on Internet of Complex Things with Apache Cassand...
DataStax Academy
 
DataStax: What's New in Apache TinkerPop - the Graph Computing Framework
DataStax Academy
 
Target: Escaping Disco-Era Data Modeling
DataStax Academy
 
Cisco UCS Integrated Infrastructure for Big Data with Cassandra
DataStax Academy
 
CAÇANDO DEUS SERVINDO AO HOMEM - EBOOK
Robson Lima
 
Roche Digital Pathology Vision
Roche Tissue Diagnostics
 
Data warehouse implementation design for a Retail business
Arsalan Qadri
 
Cloud Customer Architecture for Hybrid Integration
Cloud Standards Customer Council
 
Ad

Similar to DataStax: Datastax Enterprise - The Multi-Model Platform (20)

PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
DataStax
 
PPTX
Managing Large Amounts of Data with Salesforce
Sense Corp
 
PDF
Trivadis TechEvent 2016 Introduction to DataStax Enterprise (DSE) Graph by Gu...
Trivadis
 
PPTX
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
 
PDF
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
PPTX
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
DataStax
 
PDF
ACdP Fiware.pdf
MASSAL3
 
PDF
Big data Question bank.pdf
Sitamarhi Institute of Technology
 
PPT
CouchBase The Complete NoSql Solution for Big Data
Debajani Mohanty
 
PPTX
Building a modern data warehouse
James Serra
 
PPTX
Master Meta Data
Digikrit
 
PPTX
Microsoft Fabric Introduction
James Serra
 
PDF
Data Virtualization: Introduction and Business Value (UK)
Denodo
 
DOC
Bigdata.sunil_6+yearsExp
bigdata sunil
 
PDF
Big Data Analytics Unit I CCS334 Syllabus
Sunanthini Rajkumar
 
DOC
Big Data Analyst at BankofAmerica
Ganaparthi Jagadeesh
 
PPTX
Sample PPT-LiveMasterClass Tableau.pptx
sarithaapgcr
 
PDF
LinkedInSaxoBankDataWorkbench
Sheetal Pratik
 
PDF
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
DataStax
 
Managing Large Amounts of Data with Salesforce
Sense Corp
 
Trivadis TechEvent 2016 Introduction to DataStax Enterprise (DSE) Graph by Gu...
Trivadis
 
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
DataStax
 
ACdP Fiware.pdf
MASSAL3
 
Big data Question bank.pdf
Sitamarhi Institute of Technology
 
CouchBase The Complete NoSql Solution for Big Data
Debajani Mohanty
 
Building a modern data warehouse
James Serra
 
Master Meta Data
Digikrit
 
Microsoft Fabric Introduction
James Serra
 
Data Virtualization: Introduction and Business Value (UK)
Denodo
 
Bigdata.sunil_6+yearsExp
bigdata sunil
 
Big Data Analytics Unit I CCS334 Syllabus
Sunanthini Rajkumar
 
Big Data Analyst at BankofAmerica
Ganaparthi Jagadeesh
 
Sample PPT-LiveMasterClass Tableau.pptx
sarithaapgcr
 
LinkedInSaxoBankDataWorkbench
Sheetal Pratik
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Data Modeling for Apache Cassandra
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
PDF
Standing Up Your First Cluster
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Cassandra Core Concepts
DataStax Academy
 
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Apache Cassandra and Drivers
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Data Modeling for Apache Cassandra
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Cassandra Core Concepts
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 
Apache Cassandra and Drivers
DataStax Academy
 

Recently uploaded (20)

PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
Doc9.....................................
SofiaCollazos
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Doc9.....................................
SofiaCollazos
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 

DataStax: Datastax Enterprise - The Multi-Model Platform

  • 1. DataStax Enterprise: The Multi-Model Data Platform
  • 2. 1 Multi-Model Defined 2 The evolution of Models 3 Graph Databases Overview 4 The DataStax Approach to Graph Databases 5 DataStax in the Open Source Graph Community 6 Conclusion 2© 2015. All Rights Reserved.
  • 3. Multi Model Defined © 2015. All Rights Reserved. 3 Most database management systems are organized around a single data model that determines how data can be organized, stored, and manipulated. In contrast, a multi- model database is designed to support multiple data models against a single, integrated backend. Source - https://en.wikipedia.org/wiki/Multi-model_database
  • 4. The evolution of Models © 2015. All Rights Reserved. 4 Application Application Application (OLTP and OLAP) Data Access Abstraction RDBMS / SQL Polyglot Persistence Multi Model
  • 5. DSE Multi Model © 2015. All Rights Reserved. 5 Multiple (polygot) Models exposed via cohesive interaction mechanisms, (APIs) for OLTP and OLAP workloads, (mixed workload) with a unified persistence layer, (Apache Cassandra) providing GR, always on characteristics within a TCO efficient data base platform for a variety of use cases Cassandra
  • 6. ©2015 DataStax Confidential. Do not distribute without consent. 6 DataStax Enterprise – Wide Row / MVs C1 C2 MV1c1 MV1c2 Agg1c1 Agg2c1
  • 7. ©2015 DataStax Confidential. Do not distribute without consent. 7 DataStax Enterprise - JSON Inserting JSON data is easy Reading JSON data is easy Finding JSON errors is easy
  • 8. Introduction to Graph Databases © 2015. All Rights Reserved. 8
  • 9. What is a Graph Database? ©2015 DataStax Confidential. Do not distribute without consent. •  Store, manage and query highly connected data •  Data stored as nodes (vertices), edges and properties to represent a domain model •  By explicitly embedding relationships in the data model, you store a more logical business model using the natural data access language Gremlin •  Think of a graph as
 a pre-joined
 database
  • 10. Choose the Right Model to Fit your Business Needs © 2015 DataStax, All Rights Reserved. Company Confidential 10 DSE Wide Row - Build and maintain models using CQL’s DDL features - Super-fast CRUD at scale with CQL DML features - Good option for denormalized schemas with high-throughput requirements - Perfect fit for IoT applications that require consuming enormous amounts of data with specific data retrieval requirements: product catalogs, high-volume messaging systems, collecting and storing sensor data DSE Graph - Flexible schema that is easy to modify and maintain with Gremlin - Clearly maps business semantics to a logical model for easy maintenance and understanding - Ideal model for highly-connected data models - Perfect fit for social-engagement models, recommendation engines and IT network / device management - Update and query the graph in real-time with easy-to-learn open-source Gremlin language - Good option for transitioning from slow RDBMS 3NF models with lots of JOINs
  • 11. What is DataStax Enterprise (DSE) Graph? ©2015 DataStax •  Highly scalable graph database for modern web, mobile, and IoT applications that need to manage highly connected and heterogeneous data •  Built-in support for real-time search, and analytic graph queries via tight integration with the DSE platform •  A property graph model native inside the DataStax product, engineered specifically for Cassandra •  Store & find relationships in data fast and easy on huge graphs
  • 12. DSE Graph: Built-in Scalability ©2015 DataStax •  Scale out Graph vs. Scale up only •  Graph partitioning built on Cassandra’s scale-out architecture •  Graph index structures integrated into Cassandra •  Domain model maps more naturally to data model, allowing for greater understanding between business and IT •  Traverse millions of relationships in a short period of time, faster than modeling the data in RDBMS •  Flexible data model that can be easily adapted to business changes
  • 13. DSE Graph: Integrated Search, Analytics and Ops ©2015 DataStax •  Real-time traversals over complex-structured graph data •  Integrates with DSE Search to mix search with traversal queries •  Integrates with DSE Analytics and Spark to support OLAP and breadth-first graph traversals •  Iterative graph analytics like PageRank or other centrality measures •  Reporting and aggregates over graph data •  Integrated with DataStax OpsCenter
  • 14. Graph Database Use Cases © 2015. All Rights Reserved. 14
  • 15. Additional Graph Use Cases 360 Degree View of Your Customer •  Collect massive amounts of data point about your customer •  Data collected from social networks, web analytics, digital ads, mobile devices, CRM •  Bring heterogeneous customer data together into DSE Graph •  Uncover buying patterns and customer behaviors •  Graph becomes a master data hub for customer data •  Use the graph customer hub to build better products for your target customers •  Keep the customers you already have with customer intelligence Customer 360 View Social Store Sensors Email CRM Mobile Weblogs
  • 16. Additional Graph Use Cases IT Network and Device Management •  Allows IT to monitor, manage and protect corporate networks and devices (laptops, iPads, mobile phones, etc.) •  Requires understanding of network topologies and relationships between devices, interfaces, equipment, people, services … •  A traditional RDBMS would require expensive query-time joins •  A graph model intrinsically knows how to traverse the topology because the relationships are already stored •  This makes for quick & easy recognition of problems, root cause analysis and event correlation
  • 17. DataStax in the Graph Open Source Community TinkerPop / Gremlin © 2015. All Rights Reserved. 17
  • 18. DataStax Role in TinkerPop ©2015 DataStax •  DataStax utilizes the TinkerPop framework for the DSE Graph product •  DataStax will contribute to the TinkerPop community and is heavily invested in the success of the Gremlin language •  DataStax will provide resource guides, documentation, samples and training on building and querying graphs with Gremlin, using DSE Graph as the graph engine
  • 19. Gremlin The open-source standard graph query language •  DataStax contributes and supports the Apache TinkerPop community, along with the Gremlin Graph query language •  g.V.hasLabel('person').as('a').out('knows').as('b').select('a','b').by('age') .by('age') "for all people in the graph, give me the ages of the people on each end of a friendship relationship“ •  g.V.has('name','marko').out('knows').out('mother').outE('worksFor'). has('time',between(2001,2002)).inV.name
 
 “what are the names of the places that marko's friends' mothers worked for from 2001 to 2002”
 Deep traversal == multiple levels of query-time joins in RDBMS
  • 20. Recommendation Query – RDBMS vs. Graph ©2015 DataStax SELECT TOP (5) [t14].[ProductName] FROM (SELECT COUNT(*) AS [value], [t13].[ProductName] FROM [customers] AS [t0] CROSS APPLY (SELECT [t9].[ProductName] FROM [orders] AS [t1] CROSS JOIN [order details] AS [t2] INNER JOIN [products] AS [t3] ON [t3].[ProductID] = [t2].[ProductID] CROSS JOIN [order details] AS [t4] INNER JOIN [orders] AS [t5] ON [t5].[OrderID] = [t4].[OrderID] LEFT JOIN [customers] AS [t6] ON [t6].[CustomerID] = [t5].[CustomerID] CROSS JOIN ([orders] AS [t7] CROSS JOIN [order details] AS [t8] INNER JOIN [products] AS [t9] ON [t9].[ProductID] = [t8].[ProductID]) WHERE NOT EXISTS(SELECT NULL AS [EMPTY] FROM [orders] AS [t10] CROSS JOIN [order details] AS [t11] INNER JOIN [products] AS [t12] ON [t12].[ProductID] = [t11].[ProductID] WHERE [t9].[ProductID] = [t12].[ProductID] AND [t10].[CustomerID] = [t0].[CustomerID] AND [t11].[OrderID] = [t10].[OrderID]) AND [t6].[CustomerID] <> [t0].[CustomerID] AND [t1].[CustomerID] = [t0].[CustomerID] AND [t2].[OrderID] = [t1].[OrderID] AND [t4].[ProductID] = [t3].[ProductID] AND [t7].[CustomerID] = [t6].[CustomerID] AND [t8].[OrderID] = [t7].[OrderID]) AS [t13] WHERE [t0].[CustomerID] = N'ALFKI' GROUP BY [t13].[ProductName]) AS [t14] ORDER BY [t14].[value] DESC g.V('customerId','ALFKI').as('customer') .out('ordered').out('contains').out('is').as('products') .in('is').in('contains').in('ordered').except('customer') .out('ordered').out('contains').out('is').except('products') .groupCount().cap().orderMap(T.decr)[0..<5].productName VS.