SlideShare a Scribd company logo
data modeling for apache
cassandra
with a sprinkle of C* background and some time series fun
dani traphagen
@dtrapezoid
quiz time - who is this?
but
how
do I
even
?
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
conceptual data modeling
• abstract view of the domain
• technology independent
• not specific to any database system
entity relationship diagram
• entity types
• relationship types
• attribute types
relationship keys
attribute key participation
• relationship attributes can make part
of the key
full conceptual model
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
application workflow
• each application has a workflow
• access patterns help us get how data
is accessed
• know your queries you will run before
you design your model
application workflow
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
mapping rules & patterns
• mapping rules ensure that a logical
data model is right
• each query has a corresponding table
• tables are designed to allow queries to
execute properly
• tables return data in the correct order
mapping rules
1. entities and
relationships
2. equality search
attributes
3. inequality search
attributes
4. ordering attributes
5. key attributes
entities and relationships
1. entity and relationship types map to
tables
2. entities and relationships map to
partitions or rows
3. partition may have data about one or
more entities and relationships
4. attributes are represented by
columns
entities and relationships
Each	relationship	becomes	a	row	in	the	table
Relationship	type	attributes	are	represented	by	columns
Queries	and	relationship	cardinality	affects	the	design	of	the	primary	key
• each relationship becomes a row in the table
• relationship type attributes are represented by columns
• queries and relationship cardinality affects the design
of the primary key
equality search attributes
• equality search attributes become
initial columns of a primary key
• querying on: title and type
inequality search attributes
• clustering columns follow partition
key columns in a primary key
• the column involved in an inequality
search must come after columns in
the primary key that are used in an
equality search
ordering attributes
1. ordering attributes become clustering columns
2. querying on: user_id = ? and
uploaded_timestamp > ?
3. ordering attributes : uploaded_timestamp (DESC)
key attributes
1. relationship type key attributes are included as
primary key columns
2. queries affect primary key design
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
logical data model
this
sucks
a real world example, w/time series data
This	is	using	simple	strategy,	but	don’t	do	that	in	prod	PLEASE!
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
time series bucketing
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
well dani this is all great and
everything…but just how do i get
started? hm? i mean i want to learn
cassandra and try it and everything
but it is just so daunting. i mean
you showed me all this stuff and i
am overwhelmed and it is sort of
making me insecure. what exactly
do i do next to learn more? how do i
really dig in and learn these
concepts? have you thought about
that? have you thought about what i
NEED?
HMMMM DANI, HAVE
YOU??!!!!!!!!!!11
Data Modeling for Apache Cassandra
& prepare to be wowed
cdm
https://github.com/riptano/cdm
cdm preview
namaste
but what does
it all mean?
also, we’re
hiring

More Related Content

What's hot (20)

PDF
Introduction to Data Governance
John Bao Vuu
 
PPT
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Usman Tariq
 
PPT
Normalization
meet darji
 
ODP
Partitioning
Reema Gajjar
 
PPT
SQL Queries
Nilt1234
 
PPTX
Basic sql Commands
MUHAMMED MASHAHIL PUKKUNNUMMAL
 
PPTX
Chapter 1 introduction to sql server
baabtra.com - No. 1 supplier of quality freshers
 
PDF
DAMA Feb2015 Mastering Master Data
Mary Levins, PMP
 
PDF
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Edureka!
 
PPTX
Aggregate functions in SQL.pptx
SherinRappai
 
PPTX
Module 1 Data Governance and Stewardship Core Concepts1.pptx
Ahmad Rjoub
 
PPTX
‏‏Chapter 8: Reference and Master Data Management
Ahmed Alorage
 
PDF
Adaptive Query Execution: Speeding Up Spark SQL at Runtime
Databricks
 
ODP
Cassandra Data Modelling
Knoldus Inc.
 
PPTX
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
HostedbyConfluent
 
PDF
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
DATAVERSITY
 
PPTX
Data Quality Dashboards
William Sharp
 
PDF
Why Data Virtualization? An Introduction
Denodo
 
PDF
DataMinds 2022 Azure Purview Erwin de Kreuk
Erwin de Kreuk
 
PPT
Informatica Cloud Overview
Darren Cunningham
 
Introduction to Data Governance
John Bao Vuu
 
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Usman Tariq
 
Normalization
meet darji
 
Partitioning
Reema Gajjar
 
SQL Queries
Nilt1234
 
Basic sql Commands
MUHAMMED MASHAHIL PUKKUNNUMMAL
 
Chapter 1 introduction to sql server
baabtra.com - No. 1 supplier of quality freshers
 
DAMA Feb2015 Mastering Master Data
Mary Levins, PMP
 
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Edureka!
 
Aggregate functions in SQL.pptx
SherinRappai
 
Module 1 Data Governance and Stewardship Core Concepts1.pptx
Ahmad Rjoub
 
‏‏Chapter 8: Reference and Master Data Management
Ahmed Alorage
 
Adaptive Query Execution: Speeding Up Spark SQL at Runtime
Databricks
 
Cassandra Data Modelling
Knoldus Inc.
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
HostedbyConfluent
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
DATAVERSITY
 
Data Quality Dashboards
William Sharp
 
Why Data Virtualization? An Introduction
Denodo
 
DataMinds 2022 Azure Purview Erwin de Kreuk
Erwin de Kreuk
 
Informatica Cloud Overview
Darren Cunningham
 

Similar to Data Modeling for Apache Cassandra (20)

PDF
data-modeling-paper
Artem Chebotko
 
PPTX
Query driven development
João Paulo Leonidas Fernandes Dias da Silva
 
PDF
Using the Chebotko Method to Design Sound and Scalable Data Models for Apache...
Artem Chebotko
 
PDF
Cassandra
Lucian Neghina
 
PDF
Cassandra Data Modelling with CQL (OSCON 2015)
twentyideas
 
PDF
Cassandra Day Atlanta 2015: Data Modeling 101
DataStax Academy
 
PDF
Cassandra Day Chicago 2015: Apache Cassandra Data Modeling 101
DataStax Academy
 
PDF
Cassandra Day London 2015: Data Modeling 101
DataStax Academy
 
PDF
Introduction to data modeling with apache cassandra
Patrick McFadin
 
PPTX
Introduction to cassandra
Tarun Garg
 
PDF
The data model is dead, long live the data model
Patrick McFadin
 
PDF
Rigorous Cassandra Data Modeling for the Relational Data Architect
Artem Chebotko
 
PDF
DataStax: Rigorous Cassandra Data Modeling for the Relational Data Architect
DataStax Academy
 
PDF
Apache Cassandra & Data Modeling
Massimiliano Tomassi
 
PPTX
Apache Cassandra Developer Training Slide Deck
DataStax Academy
 
PPTX
Cassandra20141009
Brian Enochson
 
PDF
Introduction to Data Modeling with Apache Cassandra
Luke Tillman
 
DOCX
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
DOCX
llr+ cHApTEFt s Database Processing(2) Does this design e.docx
smile790243
 
PPTX
Cassandra20141113
Brian Enochson
 
data-modeling-paper
Artem Chebotko
 
Using the Chebotko Method to Design Sound and Scalable Data Models for Apache...
Artem Chebotko
 
Cassandra
Lucian Neghina
 
Cassandra Data Modelling with CQL (OSCON 2015)
twentyideas
 
Cassandra Day Atlanta 2015: Data Modeling 101
DataStax Academy
 
Cassandra Day Chicago 2015: Apache Cassandra Data Modeling 101
DataStax Academy
 
Cassandra Day London 2015: Data Modeling 101
DataStax Academy
 
Introduction to data modeling with apache cassandra
Patrick McFadin
 
Introduction to cassandra
Tarun Garg
 
The data model is dead, long live the data model
Patrick McFadin
 
Rigorous Cassandra Data Modeling for the Relational Data Architect
Artem Chebotko
 
DataStax: Rigorous Cassandra Data Modeling for the Relational Data Architect
DataStax Academy
 
Apache Cassandra & Data Modeling
Massimiliano Tomassi
 
Apache Cassandra Developer Training Slide Deck
DataStax Academy
 
Cassandra20141009
Brian Enochson
 
Introduction to Data Modeling with Apache Cassandra
Luke Tillman
 
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
llr+ cHApTEFt s Database Processing(2) Does this design e.docx
smile790243
 
Cassandra20141113
Brian Enochson
 
Ad

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
PDF
Standing Up Your First Cluster
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Cassandra Core Concepts
DataStax Academy
 
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Apache Cassandra and Drivers
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Cassandra Core Concepts
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 
Apache Cassandra and Drivers
DataStax Academy
 
Ad

Recently uploaded (20)

PDF
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pdf
ghjghvhjgc
 
PPTX
Essential Content-centric Plugins for your Website
Laura Byrne
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
PDF
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
Edge AI and Vision Alliance
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
Wondershare Filmora Crack Free Download 2025
josanj305
 
PDF
NASA A Researcher’s Guide to International Space Station : Fundamental Physics
Dr. PANKAJ DHUSSA
 
PDF
Next Generation AI: Anticipatory Intelligence, Forecasting Inflection Points ...
dleka294658677
 
PDF
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
PPTX
CapCut Pro PC Crack Latest Version Free Free
josanj305
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PPTX
Securing Model Context Protocol with Keycloak: AuthN/AuthZ for MCP Servers
Hitachi, Ltd. OSS Solution Center.
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pdf
ghjghvhjgc
 
Essential Content-centric Plugins for your Website
Laura Byrne
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
Edge AI and Vision Alliance
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Wondershare Filmora Crack Free Download 2025
josanj305
 
NASA A Researcher’s Guide to International Space Station : Fundamental Physics
Dr. PANKAJ DHUSSA
 
Next Generation AI: Anticipatory Intelligence, Forecasting Inflection Points ...
dleka294658677
 
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
CapCut Pro PC Crack Latest Version Free Free
josanj305
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
Securing Model Context Protocol with Keycloak: AuthN/AuthZ for MCP Servers
Hitachi, Ltd. OSS Solution Center.
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 

Data Modeling for Apache Cassandra