SlideShare a Scribd company logo
© COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Building on Multi-Model Databases
John Biedebach
Solution Director, MarkLogic
john.biedebach@marklogic.com
972-841-6060
SLIDE: 2 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Agenda
 How to get unified views across disparate data models and formats within a single database
 The benefits of a single product vs multi-product Multi-Model approach to data integration
 The importance of agility in data access and delivery through APIs, interfaces, and indexes
 How to scale a multi-model database while still providing ACID capabilities and security
 How to determine where a multi-model database fits in your existing architecture
SLIDE: 3 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
http://www.oreilly.com/data/free/building-
on-multi-model-databases.csp
MORE INFO
SLIDE: 4 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Hollerith Tabulator - 1890
SLIDE: 5 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
IBM Punch Card
SLIDE: 6 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Mainframes
SLIDE: 7 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
EF Codd
SLIDE: 8 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Relational Databases
 Relational Databases earn the name because prior to using them, you must define how the data
is “related”
SLIDE: 9 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
The Problem With the Relational Approach
The Business Changes,
The Requirements Change,
The Source Data Changes
1
Take a Current
State Snapshot
Design the New
Data Model
Perform ETL
Create the
Indexes
2
3
4
Build the
Application
5
Restart Process
6
SLIDE: 10 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Simple Transaction
SLIDE: 11 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Simple Transaction
Transaction
Detail
Transaction
Header
StoreProduct
SLIDE: 12 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Hierarchical Data Model
 Document-centric database
- Supports any-structured data via hierarchical data model
- Stores compressed binary trees
Document
Title
Author Section
Section
Section
Section
Section
First Last
Metadata
Transaction
Detail
Customer
Store
Product
NamePrice Phone
Address
SLIDE: 13 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Transaction
<root>
<header>
<date>2017-01-05</date>
<store>321</store>
</header>
<detail>
<item>Spaghetti Sauce</item>
<qty>1</qty>
<price unit=”usd”>8.00</price>
<item>Garlic</item>
...
</detail>
</root>
SLIDE: 14 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
The Document Model
 More natural and human-readable
 Heterogeneous data is okay (schema-
agnostic)
 Query across data harmoniously (e.g., search
for zip code, “94111”, returns both records)
 Group documents into collections (e.g., create
a collection for each source system)
 Insert/update/delete documents in a single
transaction – even if it changes the schema
{
"Customer_ID": 1001,
"Fname": "Paul",
"Lname": "Jackson",
"Phone": "415-555-1212",
"SSN": "123-45-6789",
"Addr": "123 Avenue ",
"City": "Someville",
"State": "CA",
"Zip": 94111
}
{ "Cust_ID" : 2001 ,
"Given_Name" : "Karen" ,
"Family_Name" : "Bender" ,
"Shipping_Address" : {
"Street" : "324 Some Road" ,
"City" : "San Francisco" ,
"State" : "CA" ,
"Postal" : "94111" ,
"Country" : "USA" } ,
"Billing_Address" : {
"Street" : "847 Another Ave" ,
"City" : "San Carlos" ,
"State" : "CA" ,
"Postal" : "94070" ,
"Country" : "USA" }
}
JSON
DOCUMENTS
SLIDE: 15 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 16 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 17 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 18 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 19 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
2
3
1
4
1
Ingest Data As-Is
2
Access the Data
4
Operationalize
3
Harmonize & Enrich the Data
SLIDE: 20 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 21 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
2
3
1
4
1
Ingest Data As-Is
2
Access the Data
4
Operationalize
3
Harmonize & Enrich the Data
SLIDE: 22 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 23 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Data
Lineage
SLIDE: 24 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Harmonization
SLIDE: 25 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Harmonization
SLIDE: 26 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Multi-Model
 Document (XML or JSON)
SLIDE: 27 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 28 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 29 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Multi-Model
 Document (XML or JSON)
 Geospatial
SLIDE: 30 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Joins
SLIDE: 31 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
RDF Triples
Subject Predicate Object
John IsMember PreSales
John IsSub Field_Org
John Field_OrgInfer:
SLIDE: 32 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
RDF Triples
Subject Predicate Object
John IsMember PreSales
Derek IsMember PreSales
Derek IsManager PreSales
SLIDE: 33 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 34 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 35 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Multi-Model
 Document (XML or JSON)
 Geospatial
 RDF Triples
SLIDE: 36 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 37 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Multi-Model
 Document (XML or JSON)
 Geospatial
 RDF Triples (Semantic Data)
 Other Data:
- Social Media
- Videos, Photos
- Binary Documents
OTHER METHODS
REST APIs, Java Client API, Node.js Client API, Java / .NET XCC
JSON
XML
SEMANTIC
DATA
GEOSPATIAL
DATA
BINARY
Multi-Model
SLIDE: 38 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Agenda
 How to get unified views across disparate data models and formats within a single database
 The benefits of a single product vs multi-product Multi-Model approach to data integration
 The importance of agility in data access and delivery through APIs, interfaces, and indexes
 How to scale a multi-model database while still providing ACID capabilities and security
 How to determine where a multi-model database fits in your existing architecture
SLIDE: 39 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
JAVASCRIPT
XQUERY
SPARQL
SQL
TRANSACTIONAL
APPS
ANALYTICAL
APPS
Reference Architecture
JSON
XML
RDF
GEO
Ingest
BINARY
DATA INGESTION DATA STORAGE, MANAGEMENT, AND SEARCH APPLICATIONS
EVALUATION LAYER
(E-NODES)
DATA LAYER
(D-NODES)
INDEXES, SECURITY, MONITORING, MANAGEMENT
DATA ACCESS
DOWNSTREAM
SYSTEMS
REST API
JAVA API
NODE.JS API
SLIDE: 40 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Flexible Deployment
 Write the application once, run it anywhere
 Operate in the cloud
 Operate on-premises
CLOUD
PHYSICAL / VIRTUAL
DEPLOY
SLIDE: 41 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
ACID Transactions
 100% ACID Transactions, not eventually or
strongly consistent
 Enables transactional/operational apps (e.g.
e-commerce solutions)
 Implemented using Multi-Version Concurrency
Control (MVCC)
SLIDE: 42 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Certified, Granular Security
 RBAC at the sub-document level
 Automatic security indexing enforced at the
database level for ALL data access
 Common Criteria Security Certification
PROVENANCEAVAILABILITY
CERTIFIED
AUTHENTICITYINTEGRITY
CONFIDENTIALITY
Common Criteria
SLIDE: 43 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
BITEMPORAL CERTIFIED
SECURITY
ACID
TRANSACTIONS
FLEXIBLE
DEPLOYMENT
SCALABILITY
& ELASTICITY
HA/DR TIERED
STORAGE
Enterprise Requirements for Operational Systems
Capabilities Required
Must Never Lose Data
Must Deploy in Any Environment
Must Be Cost Effective
Must Be Secure
Must Address Industry Compliance Requirements
Must Scale Up & Down
SLIDE: 44 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Agenda
 How to get unified views across disparate data models and formats within a single database
 The benefits of a single product vs multi-product Multi-Model approach to data integration
 The importance of agility in data access and delivery through APIs, interfaces, and indexes
 How to scale a multi-model database while still providing ACID capabilities and security
 How to determine where a multi-model database fits in your existing architecture
SLIDE: 45 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Is Multi-Model Right For You?
Good Fits
 Disparate Data
 Changing Schema
 Multiple Data Types
 Hard to Model
 Multiple Targets
 Master Data Mgmt
Poor Fits
 Great Fit for Relational
 Heavy Reliance on
Traditional BI Tools
 Lack of Institutional
Willingness
SLIDE: 46 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Mythbusters
 Multi-Model is not secure
 Multi-Model is not good for transactions
 Multi-Model is hard
 Multi-Model does not run in the cloud
SLIDE: 47 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
http://www.oreilly.com/data/free/building-
on-multi-model-databases.csp
MORE INFO
© COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Building on Multi-Model Databases
John Biedebach
Solution Director, MarkLogic
john.biedebach@marklogic.com
972-841-6060

More Related Content

What's hot (17)

PDF
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DATAVERSITY
 
PDF
Graph Databases for Master Data Management
Neo4j
 
PDF
A Dynamic Data Catalog for Autonomy and Self-Service
Denodo
 
PDF
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Informatica
 
PDF
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
Denodo
 
PPTX
Slides: The Business Value of Data Modeling
DATAVERSITY
 
PPTX
You Need a Data Catalog. Do You Know Why?
Precisely
 
PDF
Data Lake Architecture – Modern Strategies & Approaches
DATAVERSITY
 
PPTX
Future of data
Steven Francia
 
PDF
Neo4j Solutions - Master Data Management
Caserta
 
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Denodo
 
PDF
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo
 
PDF
Data Mesh at CMC Markets: Past, Present and Future
Lorenzo Nicora
 
PDF
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Denodo
 
PDF
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
DLT Solutions
 
PDF
6 Solution Patterns for Accelerating Self-Service BI, Cloud, Big Data, and Ot...
Denodo
 
PDF
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Denodo
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DATAVERSITY
 
Graph Databases for Master Data Management
Neo4j
 
A Dynamic Data Catalog for Autonomy and Self-Service
Denodo
 
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Informatica
 
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
Denodo
 
Slides: The Business Value of Data Modeling
DATAVERSITY
 
You Need a Data Catalog. Do You Know Why?
Precisely
 
Data Lake Architecture – Modern Strategies & Approaches
DATAVERSITY
 
Future of data
Steven Francia
 
Neo4j Solutions - Master Data Management
Caserta
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Denodo
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo
 
Data Mesh at CMC Markets: Past, Present and Future
Lorenzo Nicora
 
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Denodo
 
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
DLT Solutions
 
6 Solution Patterns for Accelerating Self-Service BI, Cloud, Big Data, and Ot...
Denodo
 
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Denodo
 

Similar to Building on Multi-Model Databases (20)

PPTX
New Trends in Data Management in the Information Industries
Matt Turner
 
PDF
Data Con LA 2018 - Agile Integration Using an Enterprise Data Hub by Michael ...
Data Con LA
 
PDF
Cwin16 - Lyon - partner mark logic - the rise of nosql
Capgemini
 
PPTX
Northeastern DB Class Introduction to Marklogic NoSQL april 2016
Matt Turner
 
PDF
A New Way of Thinking About MDM
DATAVERSITY
 
PDF
Data-Centric Infrastructure for Agile Development
DATAVERSITY
 
PDF
The New Database Frontier: Harnessing the Cloud
Inside Analysis
 
PPTX
Mark Logic Information Analysis Trends Webinar
Dave Kellogg
 
PPTX
Operationalize Your Data and Lead Your Business Transformation
Matt Turner
 
PDF
The Value of Metadata
DATAVERSITY
 
PDF
Himss DC meet mark logic
Mohamad Thahir
 
PPTX
How to achieve a single view of critical business data with MDM
Precisely
 
PDF
Achieving a Single View of Business – Critical Data with Master Data Management
DATAVERSITY
 
PDF
Cwin16 tls-partner-mark logic-an innovation journey in manufacturing
Capgemini
 
PDF
A Data Integration Case Study - Avoid Creating a “Franken-Beast”
DATAVERSITY
 
PDF
Data Lake, Virtual Database, or Data Hub - How to Choose?
DATAVERSITY
 
PDF
ETL Practices for Better or Worse
Eric Sun
 
PPTX
MediaMath - Big Data Warehousing Meetup - 2/16/2016
SoryRawyer
 
PDF
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Aaron Saray
 
PPTX
Smart Content Summit - Unlocking Content With Semantics and Metadata
Matt Turner
 
New Trends in Data Management in the Information Industries
Matt Turner
 
Data Con LA 2018 - Agile Integration Using an Enterprise Data Hub by Michael ...
Data Con LA
 
Cwin16 - Lyon - partner mark logic - the rise of nosql
Capgemini
 
Northeastern DB Class Introduction to Marklogic NoSQL april 2016
Matt Turner
 
A New Way of Thinking About MDM
DATAVERSITY
 
Data-Centric Infrastructure for Agile Development
DATAVERSITY
 
The New Database Frontier: Harnessing the Cloud
Inside Analysis
 
Mark Logic Information Analysis Trends Webinar
Dave Kellogg
 
Operationalize Your Data and Lead Your Business Transformation
Matt Turner
 
The Value of Metadata
DATAVERSITY
 
Himss DC meet mark logic
Mohamad Thahir
 
How to achieve a single view of critical business data with MDM
Precisely
 
Achieving a Single View of Business – Critical Data with Master Data Management
DATAVERSITY
 
Cwin16 tls-partner-mark logic-an innovation journey in manufacturing
Capgemini
 
A Data Integration Case Study - Avoid Creating a “Franken-Beast”
DATAVERSITY
 
Data Lake, Virtual Database, or Data Hub - How to Choose?
DATAVERSITY
 
ETL Practices for Better or Worse
Eric Sun
 
MediaMath - Big Data Warehousing Meetup - 2/16/2016
SoryRawyer
 
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Aaron Saray
 
Smart Content Summit - Unlocking Content With Semantics and Metadata
Matt Turner
 
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
PDF
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
PDF
Exploring Levels of Data Literacy
DATAVERSITY
 
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
PDF
Make Data Work for You
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
PDF
Data Modeling Fundamentals
DATAVERSITY
 
PDF
Showing ROI for Your Analytic Project
DATAVERSITY
 
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
PDF
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
PDF
2023 Trends in Enterprise Analytics
DATAVERSITY
 
PDF
Data Strategy Best Practices
DATAVERSITY
 
PDF
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
PDF
Data Management Best Practices
DATAVERSITY
 
PDF
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
Exploring Levels of Data Literacy
DATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Make Data Work for You
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Modeling Fundamentals
DATAVERSITY
 
Showing ROI for Your Analytic Project
DATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
2023 Trends in Enterprise Analytics
DATAVERSITY
 
Data Strategy Best Practices
DATAVERSITY
 
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
Data Management Best Practices
DATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Ad

Recently uploaded (20)

PPT
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
Predicting the unpredictable: re-engineering recommendation algorithms for fr...
Speck&Tech
 
PPTX
Q2 Leading a Tableau User Group - Onboarding
lward7
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PPTX
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Learn Computer Forensics, Second Edition
AnuraShantha7
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Predicting the unpredictable: re-engineering recommendation algorithms for fr...
Speck&Tech
 
Q2 Leading a Tableau User Group - Onboarding
lward7
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Learn Computer Forensics, Second Edition
AnuraShantha7
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 

Building on Multi-Model Databases

  • 1. © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Building on Multi-Model Databases John Biedebach Solution Director, MarkLogic [email protected] 972-841-6060
  • 2. SLIDE: 2 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Agenda  How to get unified views across disparate data models and formats within a single database  The benefits of a single product vs multi-product Multi-Model approach to data integration  The importance of agility in data access and delivery through APIs, interfaces, and indexes  How to scale a multi-model database while still providing ACID capabilities and security  How to determine where a multi-model database fits in your existing architecture
  • 3. SLIDE: 3 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. http://www.oreilly.com/data/free/building- on-multi-model-databases.csp MORE INFO
  • 4. SLIDE: 4 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Hollerith Tabulator - 1890
  • 5. SLIDE: 5 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. IBM Punch Card
  • 6. SLIDE: 6 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Mainframes
  • 7. SLIDE: 7 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. EF Codd
  • 8. SLIDE: 8 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Relational Databases  Relational Databases earn the name because prior to using them, you must define how the data is “related”
  • 9. SLIDE: 9 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. The Problem With the Relational Approach The Business Changes, The Requirements Change, The Source Data Changes 1 Take a Current State Snapshot Design the New Data Model Perform ETL Create the Indexes 2 3 4 Build the Application 5 Restart Process 6
  • 10. SLIDE: 10 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Simple Transaction
  • 11. SLIDE: 11 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Simple Transaction Transaction Detail Transaction Header StoreProduct
  • 12. SLIDE: 12 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Hierarchical Data Model  Document-centric database - Supports any-structured data via hierarchical data model - Stores compressed binary trees Document Title Author Section Section Section Section Section First Last Metadata Transaction Detail Customer Store Product NamePrice Phone Address
  • 13. SLIDE: 13 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Transaction <root> <header> <date>2017-01-05</date> <store>321</store> </header> <detail> <item>Spaghetti Sauce</item> <qty>1</qty> <price unit=”usd”>8.00</price> <item>Garlic</item> ... </detail> </root>
  • 14. SLIDE: 14 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. The Document Model  More natural and human-readable  Heterogeneous data is okay (schema- agnostic)  Query across data harmoniously (e.g., search for zip code, “94111”, returns both records)  Group documents into collections (e.g., create a collection for each source system)  Insert/update/delete documents in a single transaction – even if it changes the schema { "Customer_ID": 1001, "Fname": "Paul", "Lname": "Jackson", "Phone": "415-555-1212", "SSN": "123-45-6789", "Addr": "123 Avenue ", "City": "Someville", "State": "CA", "Zip": 94111 } { "Cust_ID" : 2001 , "Given_Name" : "Karen" , "Family_Name" : "Bender" , "Shipping_Address" : { "Street" : "324 Some Road" , "City" : "San Francisco" , "State" : "CA" , "Postal" : "94111" , "Country" : "USA" } , "Billing_Address" : { "Street" : "847 Another Ave" , "City" : "San Carlos" , "State" : "CA" , "Postal" : "94070" , "Country" : "USA" } } JSON DOCUMENTS
  • 15. SLIDE: 15 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 16. SLIDE: 16 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 17. SLIDE: 17 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 18. SLIDE: 18 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 19. SLIDE: 19 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. 2 3 1 4 1 Ingest Data As-Is 2 Access the Data 4 Operationalize 3 Harmonize & Enrich the Data
  • 20. SLIDE: 20 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 21. SLIDE: 21 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. 2 3 1 4 1 Ingest Data As-Is 2 Access the Data 4 Operationalize 3 Harmonize & Enrich the Data
  • 22. SLIDE: 22 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 23. SLIDE: 23 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Data Lineage
  • 24. SLIDE: 24 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Harmonization
  • 25. SLIDE: 25 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Harmonization
  • 26. SLIDE: 26 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Multi-Model  Document (XML or JSON)
  • 27. SLIDE: 27 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 28. SLIDE: 28 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 29. SLIDE: 29 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Multi-Model  Document (XML or JSON)  Geospatial
  • 30. SLIDE: 30 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Joins
  • 31. SLIDE: 31 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. RDF Triples Subject Predicate Object John IsMember PreSales John IsSub Field_Org John Field_OrgInfer:
  • 32. SLIDE: 32 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. RDF Triples Subject Predicate Object John IsMember PreSales Derek IsMember PreSales Derek IsManager PreSales
  • 33. SLIDE: 33 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 34. SLIDE: 34 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 35. SLIDE: 35 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Multi-Model  Document (XML or JSON)  Geospatial  RDF Triples
  • 36. SLIDE: 36 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 37. SLIDE: 37 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Multi-Model  Document (XML or JSON)  Geospatial  RDF Triples (Semantic Data)  Other Data: - Social Media - Videos, Photos - Binary Documents OTHER METHODS REST APIs, Java Client API, Node.js Client API, Java / .NET XCC JSON XML SEMANTIC DATA GEOSPATIAL DATA BINARY Multi-Model
  • 38. SLIDE: 38 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Agenda  How to get unified views across disparate data models and formats within a single database  The benefits of a single product vs multi-product Multi-Model approach to data integration  The importance of agility in data access and delivery through APIs, interfaces, and indexes  How to scale a multi-model database while still providing ACID capabilities and security  How to determine where a multi-model database fits in your existing architecture
  • 39. SLIDE: 39 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. JAVASCRIPT XQUERY SPARQL SQL TRANSACTIONAL APPS ANALYTICAL APPS Reference Architecture JSON XML RDF GEO Ingest BINARY DATA INGESTION DATA STORAGE, MANAGEMENT, AND SEARCH APPLICATIONS EVALUATION LAYER (E-NODES) DATA LAYER (D-NODES) INDEXES, SECURITY, MONITORING, MANAGEMENT DATA ACCESS DOWNSTREAM SYSTEMS REST API JAVA API NODE.JS API
  • 40. SLIDE: 40 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Flexible Deployment  Write the application once, run it anywhere  Operate in the cloud  Operate on-premises CLOUD PHYSICAL / VIRTUAL DEPLOY
  • 41. SLIDE: 41 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. ACID Transactions  100% ACID Transactions, not eventually or strongly consistent  Enables transactional/operational apps (e.g. e-commerce solutions)  Implemented using Multi-Version Concurrency Control (MVCC)
  • 42. SLIDE: 42 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Certified, Granular Security  RBAC at the sub-document level  Automatic security indexing enforced at the database level for ALL data access  Common Criteria Security Certification PROVENANCEAVAILABILITY CERTIFIED AUTHENTICITYINTEGRITY CONFIDENTIALITY Common Criteria
  • 43. SLIDE: 43 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. BITEMPORAL CERTIFIED SECURITY ACID TRANSACTIONS FLEXIBLE DEPLOYMENT SCALABILITY & ELASTICITY HA/DR TIERED STORAGE Enterprise Requirements for Operational Systems Capabilities Required Must Never Lose Data Must Deploy in Any Environment Must Be Cost Effective Must Be Secure Must Address Industry Compliance Requirements Must Scale Up & Down
  • 44. SLIDE: 44 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Agenda  How to get unified views across disparate data models and formats within a single database  The benefits of a single product vs multi-product Multi-Model approach to data integration  The importance of agility in data access and delivery through APIs, interfaces, and indexes  How to scale a multi-model database while still providing ACID capabilities and security  How to determine where a multi-model database fits in your existing architecture
  • 45. SLIDE: 45 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Is Multi-Model Right For You? Good Fits  Disparate Data  Changing Schema  Multiple Data Types  Hard to Model  Multiple Targets  Master Data Mgmt Poor Fits  Great Fit for Relational  Heavy Reliance on Traditional BI Tools  Lack of Institutional Willingness
  • 46. SLIDE: 46 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Mythbusters  Multi-Model is not secure  Multi-Model is not good for transactions  Multi-Model is hard  Multi-Model does not run in the cloud
  • 47. SLIDE: 47 © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. http://www.oreilly.com/data/free/building- on-multi-model-databases.csp MORE INFO
  • 48. © COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Building on Multi-Model Databases John Biedebach Solution Director, MarkLogic [email protected] 972-841-6060