SlideShare a Scribd company logo
Assessing New
Database Capabilities:
Multi-Model
Presented by: William McKnight
President, McKnight Consulting Group
williammcknight
www.mcknightcg.com
(214) 514-1444
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved.
Rick Jacobs, Technical Marketing Manager
October 10th, 2022
Enterprise Level
Advanced Analytics
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2022. All rights reserved.
Agenda
Why Couchbase
Couchbase Analytics
Use Cases & Customer Stories
1
2
3
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2019. All rights reserved.
Why Couchbase
1
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 4
How is Couchbase Different?
Mobile/Edge Apps
Applications and Microservices
Fast
• Memory-first design
• Cloud-native scale
• Geo-replication via XDCR
• HA, DR & backup
• Low latency Cloud to Edge
Familiar
• SQL++ query language
• Dynamic Schema
• ACID SQL Transactions
• Cost-based optimizer
• SDKs for 12+ languages
Affordable
• Elastic scaling, sharding &
rebalancing
• Multidimensional scaling
• High-density storage
• Incredible price/performance
Flexible
• JSON document
• Multimodel services
• Cloud deploy anywhere
• Mobile & Edge ready
SQL
Integrated
Cache
JSON
Documents
SQL
Query
Full Text
Search
Operational
Analytics
Eventing
Key-Value
Access
Geo-Replication
& Sync
Mobile
Database
Relational
Capabilities
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 5
Database-as-a-Service Self-Managed Cloud
• Maximize convenience
• Easy to start, manage, and scale
• Industry leading price-performance
• Highly available and secure
• Maximize control & customizability
• Leverage DBA’s & OPS team skills
• Choose management strategy & tools
• Deploy via Kubernetes if you choose
Capella Server
Flexible Cloud and Edge Options: Delivering Consistency
“We wanted a solution that seamlessly works across server and mobile, without lots of
retraining. No other solutions came even close to Couchbase.”
Aviram Agmon
Chief Technical Officer
Maccabi
• Offline first design for max uptime
• Extreme speed and reliability
• Data integrity: secure, automated sync
• Broad SQL and device support
Edge & IoT
Mobile
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2019. All rights reserved.
Couchbase
Analytics
2
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 7
Analytics fundamentals
• Fast ingestion
• Near real-time data availability (using DCP)
• No ETL (simple, no paradigm shift)
• Same data model and query language
• MPP processing
• Uses best-of-breed DW algorithms (join,
aggregation, sorting)
• Memory-conscious operators (DGM)
• Workload isolation
• MDS – has its own sub-cluster
• Each query uses all resources
Operations Data
Real-time Analytics
Analytics Tool
Business
Application Ops Data
Node
Analytics
Node
Couchbase Data Platform
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 8
Timely
Operational data is
readily available for
analytics when created
and as current
as possible
Flexible
Schema changes on
operational side don’t
impact analyses
Speedy
Analysis queries run
quickly without
impacting operational
performance
Scalable
Scale to
speed up queries
and scale up data
Requirements for an Agile Analytics Platform
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved.
Couchbase Analytics Architecture
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2022. All rights reserved.
Customer Stories
3
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 11
Key Use Cases
Need: Perform data exploration on
operational data in near-real time with
agile data science modeling
Outcome: Enabled new customer
attributes to enable data science
focused consumer segment strategies
→ faster time to insights for consumer
marketing responses from
weeks/months to hours
Need: Perform complex analytical
queries, computations, and aggregations
on JSON data enriched with 3rd party
data without data movement
Outcome: Analytics Service powered
regression calculations to compute 2M+
prices to further improve query
performance by 100% for 200GB+ data.
No need for ETL
eCommerce
Real-time marketing
campaigns
Finance
Investments Modeling
Need: Scale data platform to meet
increased analytics and reporting needs
Outcome: Executives able to answer
key business revenue impact questions
→ “Show detailed effects of COVID-19
on hospitals cancelling elective
procedures to identify underpaid or
unidentified revenue”
Healthcare
Hospital/Clinics Customer
Revenue
Personalized Ordering Risk Scoring BI & Data Scale
eCommerce Food Delivery. Finance. Healthcare
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 12
Confidential and Proprietary. Do not distribute without
Couchbase consent. © Couchbase 2020. All rights
reserved.
Outcomes
• Reduction of targeted consumer
offers from of weeks/months →
hours & analyze data in near real-
time
• Enabled agile data mining models
focused on order behaviors, propensity
scoring and enabled flexible attribute
creation
• Removed need to ETL for data
science experiments
Requirements
• Track average transaction size,
annual purchase frequency and
loyalty to determine customer lifetime
value (CLV)
• Deliver personalized marketing
campaigns, segments and reduce
time to perform data science
experiments
• Ability to perform data exploration on
operational data in near-real time
SOLUTION:
Customer Data Management
APPLICATION:
Commerce Data Hub
Data science experimentation
USE CASE(S):
Real time marketing
campaigns and personalized
ordering experience
ABOUT:
World leader in pizza delivery
operating a network of
company-owned and
franchise-owned stores
globally. 3M pizzas a day,
16.5K stores in 85 countries
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 13
Confidential and Proprietary. Do not distribute without
Couchbase consent. © Couchbase 2020. All rights
reserved.
Requirements
• Action on near real-time data flow without
transformation
• Enable better fan experience at concession
stands during games and IoT functionality
for ticket scans
• Easy to use SQL-like interface as their
resources are lean and skilled in SQL
Outcomes
• Continuous data sync for real-time
visitor and customer concessionaire
analytics
• Increased customer engagement via
interactive scoreboards, fan kiosks, and
more
• Easy integration with Knowi and Tableau
for real-time executive reporting
SOLUTION:
Customer 360
APPLICATION:
Ticket scan
VIP loyalty program
USE CASE(S):
Real time analytics for
fan interactions
ABOUT:
Professional baseball
franchise valued at
$600M+ with 1.8M+
fan base
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 14
Scaling
legacy DB
Mainframe
access
NoSQL
sprawl
Scaling other
NoSQL DB
Managing
multiple DBs
Dedicated DB
per use case
Slow
dev. cycles
Mission-critical
new features
Ever-changing
requirements
Mobile apps
take too long
Modern DB
tech. required
Need to
consolidate tech.
Personalization
+ performance
Fully featured
mobile apps
Single view of
customer
Legacy = more
time, $$, effort
Integrate
disparate data
Delivering Business Outcomes by Solving
Technology Problems
Improving
customer
experience &
engagement
Faster
innovation
& time to
market
Reducing
infrastructure
& operations
costs
Predictable
performance
Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved.
Try Couchbase Capella free:
No credit card required
https://www.couchbase.com/products/capella/get-started
THANK YOU
William McKnight
President, McKnight Consulting Group
• Frequent keynote speaker and trainer internationally
• Consulted to Pfizer, Scotiabank, Fidelity, TD
Ameritrade, Teva Pharmaceuticals, Verizon, and many
other Global 1000 companies
• Hundreds of articles, blogs and white papers in
publication
• Focused on delivering business value and solving
business problems utilizing proven, streamlined
approaches to information management
• Former Database Engineer, Fortune 50 Information
Technology executive and Ernst&Young Entrepreneur
of Year Finalist
• Owner/consultant: Research, Data Strategy and
Implementation consulting firm
2
William McKnight
The Savvy Manager’s Guide
The
Savvy
Manager’s
Guide
Information
Management
Information Management
Strategies for Gaining a
Competitive Advantage with Data
McKnight Consulting Group Offerings
Strategy
Training
Strategy
§ Trusted Advisor
§ Action Plans
§ Roadmaps
§ Tool Selections
§ Program Management
Training
§ Classes
§ Workshops
Implementation
§ Data/Data Warehousing/Business
Intelligence/Analytics
§ Big Data
§ Master Data Management
§ Governance/Quality
Implementation
3
McKnight Consulting Group Client Portfolio
Decisions, Decisions, Decisions
• Unprecedented variety of data store choices to meet
the needs of their varied workloads
• Enterprises have many needs for databases, including
cache, operational, data warehouse, master data, ERP,
analytical, graph data, data lake, and time series data
• While vendor offerings have exploded in recent
years, in due time frameworks will integrate
components into what amounts to a single offering
for multiple workloads, perhaps even for the
enterprise
• But what if price-performant offerings for adjacent
workloads in an enterprise have materialized?
5
Many Data Types
• Web Crawlers
• Open Linked Data
• JSON
• XML
• Documents
• Binary
• Graph
• Log Files
6
Why NoSQL for Operational Big Data
More data model flexibility
– Web Services as a data model
– No !schema first" requirement; load first
Faster time to insight from data acquisition
Relaxed ACID
– Eventual consistency
– Willing to trade consistency for availability
– ACID would crush things like storing clicks on Google
Low upfront software and development costs
Programmers love the freedoms
Fault-tolerant redundancy
Linear Scaling to “webscale”
7
• Placement policy:
A copy is written to the node creating the file (write affinity)
A second copy is written to a data node within the same rack (to
minimize cross-rack network traffic)
A third copy is written to a data node in a different rack (to tolerate
switch failures)
Node 5
Node 4
Node 3
Node 2
Node 1
Block
1
Block
3
Block
2
Block
1
Block
3
Block
2
Block
3
Block
2
Block
1
Objectives: load balancing, fast access, fault tolerance
DFS Block Placement
8
CAR
DRIVES
name: “Dan”
born: May 29, 1970
twitter: “@dan”
name: “Ann”
born: Dec 5, 1975
since:
Jan 10, 2011
brand: “Volvo”
model: “V70”
Property Graph Model Components
Nodes
• The objects in the graph
• Can have name-value
properties
• Can be labeled
friends
friends
LIVES WITH
O
W
N
S
PERSON PERSON
Relationships
• Relate nodes by type and
direction
• Can have name-value
properties
9
Semantic Graph
• RDF Triple Store
– Semantic databases only work with RDF
• Target market is users of third-party
data in RDF (all Linked open data)
– Working across data sets
10
Databases are Multi-Model when they can
be either (for example):
11
Data Types and NoSQL Data Models
Data Type Data Model
CSV, TSV or web logs Column, Document
Documents Document
JSON Document
Metadata catalog Column, Document
Keyed images and documents Key-Value
RDF, Linked data Graph
12
Key-Value Stores
What are they?
• NoSQL’s OLTP equivalent
• Extremely simple
• Key-”blob pairs”, that’s it
• Associative array data model
• Retrieve value given a key
– All access is by a key
(key,value)
13
Key-Value Stores
Technical Characteristics:
• Horizontally scalable
• Fast (did I mention fast)
• Resiliency to cluster failures
• Simplicity
• All nodes equal
14
(key,value)
Key-Value Stores
Good for:
• Any single object of unstructured data
• Storing BLOBs
• Fast writes
• Web app cache
• Session Information – get all session information in a
single put/get
• User profile data
• Massive multi-player on-line gaming
• Shopping carts (up until the payment transaction)
• Geo-localized processing
• Speed when you can’t be down
(key,value)
15
A multi-model database is a single, integrated
database that can store, manage and query data i
multiple models such as relational, document,
graph, key-value, column-store, cache. It is the
opposite approach to Polyglot Persistence – the
use of multiple databases in a workload.
16
Document-oriented Databases
What are they?
• Key-Value Stores with added capabilities
– Ability to nest sub-documents
• JSON/XML data models
• With Tree-Like Structure
• Encapsulated document objects
• Groups data together more naturally and
logically
17
Document-oriented Databases
Technical Characteristics:
• Store all data together
– Example: Order document contains all line items
• Documents are self-describing hierarchical tree
structures
• Unlike Key-Value Stores, the value part of the field
can be queried
18
Document-oriented Databases
Good for:
• Semi-structured data
• Web pages
• Web traffic/E-Commerce
• Web analytics
• Log files
• User actions/behaviors
• Content Management Systems
• Full text
• Uncertain data
• Extending object-oriented approaches
• Event logging
• JSON/XML data
19
Document Example
{
"type": "BakingRecipe",
"name": "Mama’s Cornbread",
"ingredients": [
{ "name": "cornmeal", "amount": ”1c" },
{ "name": "flour", "amount": "3/4c" },
{ "name": "baking powder", "amount": "1-1/2t" },
{ "name": "eggs", "amount": "2 large" },
{ "name": ”butter", "amount": "6T" },
{ "name": "buttermilk", "amount": "1-1/2c”,
“brand”: “ABC Brand”}
],
”ovenTemperature": ”425 deg F"
”bakeTime": ”20 min”
}
20
Multiple NoSQL Solutions Working Together
You could use
• Key-Value Store for Shopping Cart and
Session Data
• Document or Column Store for Consuming
Completed Orders
• RDBMS for inventory (small, not served real-
time), financials
• Graph Store for Customer Relationships for
Marketing
21
Column Stores
What are they?
• Data model:
– A big table, with column families
– Map-reduce for querying/processing
• Schema-lite
• No single point of failure
• Operational simplicity
• Closest NoSQL implementation to RDBMS
22
Column Stores
Good for:
• Large amounts of data
• Data that needs compression
• Event logging
• Content Management Systems
• Data model supports semi-structured
data
• Naturally indexed (columns)
• Good at scaling out horizontally
• Time Series data
– Weather data
– Location data
– Sensor data
23
Column Stores Example
24
What to Look for in Multi-Model 1/2
• Excellent implementation of multiple
models
• Single copy of data
• Model change propagation
• Works in microservices world
• Submillisecond response time
25
What to Look for in Multi-Model 2/2
• Globally distributed multi-region
deployments
• Cross-model data processing language
and optimizer
• Edge-capable database
• JSON flattening without data explosion
• Universal indices
26
Emerging Technologies
• Use of artificial
intelligence (AI)
• Integration with data
catalog platforms
• Robust user
experience
• Multi-cloud/native
application
27
Assessing New
Database Capabilities:
Multi-Model
Presented by: William McKnight
President, McKnight Consulting Group
williammcknight
www.mcknightcg.com
(214) 514-1444

More Related Content

Similar to Assessing New Database Capabilities – Multi-Model (20)

PDF
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
confluent
 
PDF
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Matt Stubbs
 
PPTX
Revolutionizing the customer experience - Hello Engagement Database
Dipti Borkar
 
PPTX
Couchbase Chennai Meetup 2 - Big Data & Analytics
RedBlackTree
 
PDF
Data-Ed Webinar: Data Modeling Fundamentals
DATAVERSITY
 
PPTX
How companies-use-no sql-and-couchbase-10152013
Dipti Borkar
 
PDF
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
PDF
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
PDF
Slides: Moving from a Relational Model to NoSQL
DATAVERSITY
 
PPTX
Betfair + Couchbase
bloodredsun
 
PDF
Slides: Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
PDF
SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
✔ Eric David Benari, PMP
 
PPTX
Hofstra University - Overview of Big Data
sarasioux
 
PPTX
Why microservices architectures drive exceptional customer experiences
Denis Wilson Souza Rosa
 
PDF
NoSQL - Vital Open Source Ingredient for Modern Success
Arun Gupta
 
PDF
NoSQL, the Vital Open Source Ingredient for Modern Success
All Things Open
 
PPTX
Big data presentationandoverview_of_couchbase
AMAR NATH
 
PDF
Are You Prepared For The Future Of Data Technologies?
Dell World
 
PDF
Impacto del Big Data en la empresa española
Paradigma Digital
 
PPTX
Stream me to the Cloud (and back) with Confluent & MongoDB
confluent
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
confluent
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Matt Stubbs
 
Revolutionizing the customer experience - Hello Engagement Database
Dipti Borkar
 
Couchbase Chennai Meetup 2 - Big Data & Analytics
RedBlackTree
 
Data-Ed Webinar: Data Modeling Fundamentals
DATAVERSITY
 
How companies-use-no sql-and-couchbase-10152013
Dipti Borkar
 
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
Slides: Moving from a Relational Model to NoSQL
DATAVERSITY
 
Betfair + Couchbase
bloodredsun
 
Slides: Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
✔ Eric David Benari, PMP
 
Hofstra University - Overview of Big Data
sarasioux
 
Why microservices architectures drive exceptional customer experiences
Denis Wilson Souza Rosa
 
NoSQL - Vital Open Source Ingredient for Modern Success
Arun Gupta
 
NoSQL, the Vital Open Source Ingredient for Modern Success
All Things Open
 
Big data presentationandoverview_of_couchbase
AMAR NATH
 
Are You Prepared For The Future Of Data Technologies?
Dell World
 
Impacto del Big Data en la empresa española
Paradigma Digital
 
Stream me to the Cloud (and back) with Confluent & MongoDB
confluent
 

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
PDF
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
PDF
Exploring Levels of Data Literacy
DATAVERSITY
 
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
PDF
Make Data Work for You
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
PDF
Data Modeling Fundamentals
DATAVERSITY
 
PDF
Showing ROI for Your Analytic Project
DATAVERSITY
 
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
PDF
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
PDF
2023 Trends in Enterprise Analytics
DATAVERSITY
 
PDF
Data Strategy Best Practices
DATAVERSITY
 
PDF
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
PDF
Data Management Best Practices
DATAVERSITY
 
PDF
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
Exploring Levels of Data Literacy
DATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Make Data Work for You
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Modeling Fundamentals
DATAVERSITY
 
Showing ROI for Your Analytic Project
DATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
2023 Trends in Enterprise Analytics
DATAVERSITY
 
Data Strategy Best Practices
DATAVERSITY
 
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
Data Management Best Practices
DATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 

Recently uploaded (20)

PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PPTX
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
Climate Action.pptx action plan for climate
justfortalabat
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 

Assessing New Database Capabilities – Multi-Model

  • 1. Assessing New Database Capabilities: Multi-Model Presented by: William McKnight President, McKnight Consulting Group williammcknight www.mcknightcg.com (214) 514-1444
  • 2. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. Rick Jacobs, Technical Marketing Manager October 10th, 2022 Enterprise Level Advanced Analytics
  • 3. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2022. All rights reserved. Agenda Why Couchbase Couchbase Analytics Use Cases & Customer Stories 1 2 3
  • 4. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2019. All rights reserved. Why Couchbase 1
  • 5. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 4 How is Couchbase Different? Mobile/Edge Apps Applications and Microservices Fast • Memory-first design • Cloud-native scale • Geo-replication via XDCR • HA, DR & backup • Low latency Cloud to Edge Familiar • SQL++ query language • Dynamic Schema • ACID SQL Transactions • Cost-based optimizer • SDKs for 12+ languages Affordable • Elastic scaling, sharding & rebalancing • Multidimensional scaling • High-density storage • Incredible price/performance Flexible • JSON document • Multimodel services • Cloud deploy anywhere • Mobile & Edge ready SQL Integrated Cache JSON Documents SQL Query Full Text Search Operational Analytics Eventing Key-Value Access Geo-Replication & Sync Mobile Database Relational Capabilities
  • 6. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 5 Database-as-a-Service Self-Managed Cloud • Maximize convenience • Easy to start, manage, and scale • Industry leading price-performance • Highly available and secure • Maximize control & customizability • Leverage DBA’s & OPS team skills • Choose management strategy & tools • Deploy via Kubernetes if you choose Capella Server Flexible Cloud and Edge Options: Delivering Consistency “We wanted a solution that seamlessly works across server and mobile, without lots of retraining. No other solutions came even close to Couchbase.” Aviram Agmon Chief Technical Officer Maccabi • Offline first design for max uptime • Extreme speed and reliability • Data integrity: secure, automated sync • Broad SQL and device support Edge & IoT Mobile
  • 7. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2019. All rights reserved. Couchbase Analytics 2
  • 8. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 7 Analytics fundamentals • Fast ingestion • Near real-time data availability (using DCP) • No ETL (simple, no paradigm shift) • Same data model and query language • MPP processing • Uses best-of-breed DW algorithms (join, aggregation, sorting) • Memory-conscious operators (DGM) • Workload isolation • MDS – has its own sub-cluster • Each query uses all resources Operations Data Real-time Analytics Analytics Tool Business Application Ops Data Node Analytics Node Couchbase Data Platform
  • 9. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. 8 Timely Operational data is readily available for analytics when created and as current as possible Flexible Schema changes on operational side don’t impact analyses Speedy Analysis queries run quickly without impacting operational performance Scalable Scale to speed up queries and scale up data Requirements for an Agile Analytics Platform
  • 10. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. Couchbase Analytics Architecture
  • 11. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2022. All rights reserved. Customer Stories 3
  • 12. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 11 Key Use Cases Need: Perform data exploration on operational data in near-real time with agile data science modeling Outcome: Enabled new customer attributes to enable data science focused consumer segment strategies → faster time to insights for consumer marketing responses from weeks/months to hours Need: Perform complex analytical queries, computations, and aggregations on JSON data enriched with 3rd party data without data movement Outcome: Analytics Service powered regression calculations to compute 2M+ prices to further improve query performance by 100% for 200GB+ data. No need for ETL eCommerce Real-time marketing campaigns Finance Investments Modeling Need: Scale data platform to meet increased analytics and reporting needs Outcome: Executives able to answer key business revenue impact questions → “Show detailed effects of COVID-19 on hospitals cancelling elective procedures to identify underpaid or unidentified revenue” Healthcare Hospital/Clinics Customer Revenue Personalized Ordering Risk Scoring BI & Data Scale eCommerce Food Delivery. Finance. Healthcare
  • 13. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 12 Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. Outcomes • Reduction of targeted consumer offers from of weeks/months → hours & analyze data in near real- time • Enabled agile data mining models focused on order behaviors, propensity scoring and enabled flexible attribute creation • Removed need to ETL for data science experiments Requirements • Track average transaction size, annual purchase frequency and loyalty to determine customer lifetime value (CLV) • Deliver personalized marketing campaigns, segments and reduce time to perform data science experiments • Ability to perform data exploration on operational data in near-real time SOLUTION: Customer Data Management APPLICATION: Commerce Data Hub Data science experimentation USE CASE(S): Real time marketing campaigns and personalized ordering experience ABOUT: World leader in pizza delivery operating a network of company-owned and franchise-owned stores globally. 3M pizzas a day, 16.5K stores in 85 countries
  • 14. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 13 Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2020. All rights reserved. Requirements • Action on near real-time data flow without transformation • Enable better fan experience at concession stands during games and IoT functionality for ticket scans • Easy to use SQL-like interface as their resources are lean and skilled in SQL Outcomes • Continuous data sync for real-time visitor and customer concessionaire analytics • Increased customer engagement via interactive scoreboards, fan kiosks, and more • Easy integration with Knowi and Tableau for real-time executive reporting SOLUTION: Customer 360 APPLICATION: Ticket scan VIP loyalty program USE CASE(S): Real time analytics for fan interactions ABOUT: Professional baseball franchise valued at $600M+ with 1.8M+ fan base
  • 15. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. 14 Scaling legacy DB Mainframe access NoSQL sprawl Scaling other NoSQL DB Managing multiple DBs Dedicated DB per use case Slow dev. cycles Mission-critical new features Ever-changing requirements Mobile apps take too long Modern DB tech. required Need to consolidate tech. Personalization + performance Fully featured mobile apps Single view of customer Legacy = more time, $$, effort Integrate disparate data Delivering Business Outcomes by Solving Technology Problems Improving customer experience & engagement Faster innovation & time to market Reducing infrastructure & operations costs Predictable performance
  • 16. Confidential and Proprietary. Do not distribute without Couchbase consent. © Couchbase 2021. All rights reserved. Try Couchbase Capella free: No credit card required https://www.couchbase.com/products/capella/get-started THANK YOU
  • 17. William McKnight President, McKnight Consulting Group • Frequent keynote speaker and trainer internationally • Consulted to Pfizer, Scotiabank, Fidelity, TD Ameritrade, Teva Pharmaceuticals, Verizon, and many other Global 1000 companies • Hundreds of articles, blogs and white papers in publication • Focused on delivering business value and solving business problems utilizing proven, streamlined approaches to information management • Former Database Engineer, Fortune 50 Information Technology executive and Ernst&Young Entrepreneur of Year Finalist • Owner/consultant: Research, Data Strategy and Implementation consulting firm 2 William McKnight The Savvy Manager’s Guide The Savvy Manager’s Guide Information Management Information Management Strategies for Gaining a Competitive Advantage with Data
  • 18. McKnight Consulting Group Offerings Strategy Training Strategy § Trusted Advisor § Action Plans § Roadmaps § Tool Selections § Program Management Training § Classes § Workshops Implementation § Data/Data Warehousing/Business Intelligence/Analytics § Big Data § Master Data Management § Governance/Quality Implementation 3
  • 19. McKnight Consulting Group Client Portfolio
  • 20. Decisions, Decisions, Decisions • Unprecedented variety of data store choices to meet the needs of their varied workloads • Enterprises have many needs for databases, including cache, operational, data warehouse, master data, ERP, analytical, graph data, data lake, and time series data • While vendor offerings have exploded in recent years, in due time frameworks will integrate components into what amounts to a single offering for multiple workloads, perhaps even for the enterprise • But what if price-performant offerings for adjacent workloads in an enterprise have materialized? 5
  • 21. Many Data Types • Web Crawlers • Open Linked Data • JSON • XML • Documents • Binary • Graph • Log Files 6
  • 22. Why NoSQL for Operational Big Data More data model flexibility – Web Services as a data model – No !schema first" requirement; load first Faster time to insight from data acquisition Relaxed ACID – Eventual consistency – Willing to trade consistency for availability – ACID would crush things like storing clicks on Google Low upfront software and development costs Programmers love the freedoms Fault-tolerant redundancy Linear Scaling to “webscale” 7
  • 23. • Placement policy: A copy is written to the node creating the file (write affinity) A second copy is written to a data node within the same rack (to minimize cross-rack network traffic) A third copy is written to a data node in a different rack (to tolerate switch failures) Node 5 Node 4 Node 3 Node 2 Node 1 Block 1 Block 3 Block 2 Block 1 Block 3 Block 2 Block 3 Block 2 Block 1 Objectives: load balancing, fast access, fault tolerance DFS Block Placement 8
  • 24. CAR DRIVES name: “Dan” born: May 29, 1970 twitter: “@dan” name: “Ann” born: Dec 5, 1975 since: Jan 10, 2011 brand: “Volvo” model: “V70” Property Graph Model Components Nodes • The objects in the graph • Can have name-value properties • Can be labeled friends friends LIVES WITH O W N S PERSON PERSON Relationships • Relate nodes by type and direction • Can have name-value properties 9
  • 25. Semantic Graph • RDF Triple Store – Semantic databases only work with RDF • Target market is users of third-party data in RDF (all Linked open data) – Working across data sets 10
  • 26. Databases are Multi-Model when they can be either (for example): 11
  • 27. Data Types and NoSQL Data Models Data Type Data Model CSV, TSV or web logs Column, Document Documents Document JSON Document Metadata catalog Column, Document Keyed images and documents Key-Value RDF, Linked data Graph 12
  • 28. Key-Value Stores What are they? • NoSQL’s OLTP equivalent • Extremely simple • Key-”blob pairs”, that’s it • Associative array data model • Retrieve value given a key – All access is by a key (key,value) 13
  • 29. Key-Value Stores Technical Characteristics: • Horizontally scalable • Fast (did I mention fast) • Resiliency to cluster failures • Simplicity • All nodes equal 14 (key,value)
  • 30. Key-Value Stores Good for: • Any single object of unstructured data • Storing BLOBs • Fast writes • Web app cache • Session Information – get all session information in a single put/get • User profile data • Massive multi-player on-line gaming • Shopping carts (up until the payment transaction) • Geo-localized processing • Speed when you can’t be down (key,value) 15
  • 31. A multi-model database is a single, integrated database that can store, manage and query data i multiple models such as relational, document, graph, key-value, column-store, cache. It is the opposite approach to Polyglot Persistence – the use of multiple databases in a workload. 16
  • 32. Document-oriented Databases What are they? • Key-Value Stores with added capabilities – Ability to nest sub-documents • JSON/XML data models • With Tree-Like Structure • Encapsulated document objects • Groups data together more naturally and logically 17
  • 33. Document-oriented Databases Technical Characteristics: • Store all data together – Example: Order document contains all line items • Documents are self-describing hierarchical tree structures • Unlike Key-Value Stores, the value part of the field can be queried 18
  • 34. Document-oriented Databases Good for: • Semi-structured data • Web pages • Web traffic/E-Commerce • Web analytics • Log files • User actions/behaviors • Content Management Systems • Full text • Uncertain data • Extending object-oriented approaches • Event logging • JSON/XML data 19
  • 35. Document Example { "type": "BakingRecipe", "name": "Mama’s Cornbread", "ingredients": [ { "name": "cornmeal", "amount": ”1c" }, { "name": "flour", "amount": "3/4c" }, { "name": "baking powder", "amount": "1-1/2t" }, { "name": "eggs", "amount": "2 large" }, { "name": ”butter", "amount": "6T" }, { "name": "buttermilk", "amount": "1-1/2c”, “brand”: “ABC Brand”} ], ”ovenTemperature": ”425 deg F" ”bakeTime": ”20 min” } 20
  • 36. Multiple NoSQL Solutions Working Together You could use • Key-Value Store for Shopping Cart and Session Data • Document or Column Store for Consuming Completed Orders • RDBMS for inventory (small, not served real- time), financials • Graph Store for Customer Relationships for Marketing 21
  • 37. Column Stores What are they? • Data model: – A big table, with column families – Map-reduce for querying/processing • Schema-lite • No single point of failure • Operational simplicity • Closest NoSQL implementation to RDBMS 22
  • 38. Column Stores Good for: • Large amounts of data • Data that needs compression • Event logging • Content Management Systems • Data model supports semi-structured data • Naturally indexed (columns) • Good at scaling out horizontally • Time Series data – Weather data – Location data – Sensor data 23
  • 40. What to Look for in Multi-Model 1/2 • Excellent implementation of multiple models • Single copy of data • Model change propagation • Works in microservices world • Submillisecond response time 25
  • 41. What to Look for in Multi-Model 2/2 • Globally distributed multi-region deployments • Cross-model data processing language and optimizer • Edge-capable database • JSON flattening without data explosion • Universal indices 26
  • 42. Emerging Technologies • Use of artificial intelligence (AI) • Integration with data catalog platforms • Robust user experience • Multi-cloud/native application 27
  • 43. Assessing New Database Capabilities: Multi-Model Presented by: William McKnight President, McKnight Consulting Group williammcknight www.mcknightcg.com (214) 514-1444