SlideShare a Scribd company logo
DataStax Advanced Replication
Brian Hess
Sr. Product Manager, Analytics
Why Advanced Replication
• Standard Cassandra replication has its limits
• Lots of disconnected “edge” nodes/data centers/clusters
• Replicating to central “mother ship” for aggregating
• Inconsistent connectivity
• All data centers are read-write – no read-only DCs
2© 2016 DataStax, All Rights Reserved.
What is Advanced Replication
• Advanced Replication supports:
• Many edge clusters replicating to a central hub
• Consistent or sporadic connectivity – “store and forward”
• Prioritized streams for limited bandwidth situations
• One-way replication
• Active queries at the edge, as well as replicating to the hub
• Search/Analytics supported at edge and hub clusters
3© 2016 DataStax, All Rights Reserved.
Company Confidential
“What was Brian’s
average purchase
per store this
week?”
Analytics Over
All Data
“What did Brian buy
today across all
stores?”
Can Query
Global Sales
“What was the
hottest product
here this week?”
Analytics of
Local Sales
“What did Brian buy
here today?”
Can Query
Local Sales
Each Store Central Hub
Example: Retail Sales
© 2016 DataStax, All Rights Reserved.
Company Confidential
Key Verticals
© 2016 DataStax, All Rights Reserved.
Architecture – Edge View
6
Client
Edge
Replication
Log
Collection
Agent Table
Replication
Agent
Hub Cluster
Table
© 2016 DataStax, All Rights Reserved.
Architecture – Edge View
7
Client
Edge
Replication
Log
Collection
Agent Table
Replication
Agent
Hub Cluster
Table
Normal CQL
Operation
CQL Trigger
captures
mutation
Maintained in C*
table for Fault
Tolerance
Pulls from
Replication Log in
priority/time order
Replicates to
Hub via normal
CQL driver
High Priority mutations
opportunistically sent to
Hub asynchronously
© 2016 DataStax, All Rights Reserved.
Topology
8© 2016 DataStax, All Rights Reserved.
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra

More Related Content

What's hot (20)

PDF
Standing Up Your First Cluster
DataStax Academy
 
PPTX
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
DataStax
 
PDF
Real-time personal trainer on the SMACK stack
Anirvan Chakraborty
 
PPTX
Announcing Spark Driver for Cassandra
DataStax
 
PPTX
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
DataStax
 
PDF
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
DataStax
 
PPTX
Cassandra Community Webinar: MySQL to Cassandra - What I Wish I'd Known
DataStax
 
PDF
Cassandra at eBay - Cassandra Summit 2013
Jay Patel
 
PPTX
There are More Clouds! Azure and Cassandra (Carlos Rolo, Pythian) | C* Summit...
DataStax
 
PPTX
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
DataStax
 
PDF
Data Pipelines with Spark & DataStax Enterprise
DataStax
 
PPTX
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
DataStax
 
PPTX
How jKool Analyzes Streaming Data in Real Time with DataStax
DataStax
 
PPT
Reporting from the Trenches: Intuit & Cassandra
DataStax
 
PDF
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
DataStax Academy
 
PDF
C*ollege Credit: Is My App a Good Fit for Cassandra?
DataStax
 
PDF
Cassandra TK 2014 - Large Nodes
aaronmorton
 
PPTX
mParticle's Journey to Scylla from Cassandra
ScyllaDB
 
PPTX
Data Modeling Basics for the Cloud with DataStax
DataStax
 
PPTX
Transforms Document Management at Scale with Distributed Database Solution wi...
DataStax Academy
 
Standing Up Your First Cluster
DataStax Academy
 
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
DataStax
 
Real-time personal trainer on the SMACK stack
Anirvan Chakraborty
 
Announcing Spark Driver for Cassandra
DataStax
 
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
DataStax
 
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
DataStax
 
Cassandra Community Webinar: MySQL to Cassandra - What I Wish I'd Known
DataStax
 
Cassandra at eBay - Cassandra Summit 2013
Jay Patel
 
There are More Clouds! Azure and Cassandra (Carlos Rolo, Pythian) | C* Summit...
DataStax
 
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
DataStax
 
Data Pipelines with Spark & DataStax Enterprise
DataStax
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
DataStax
 
How jKool Analyzes Streaming Data in Real Time with DataStax
DataStax
 
Reporting from the Trenches: Intuit & Cassandra
DataStax
 
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
DataStax Academy
 
C*ollege Credit: Is My App a Good Fit for Cassandra?
DataStax
 
Cassandra TK 2014 - Large Nodes
aaronmorton
 
mParticle's Journey to Scylla from Cassandra
ScyllaDB
 
Data Modeling Basics for the Cloud with DataStax
DataStax
 
Transforms Document Management at Scale with Distributed Database Solution wi...
DataStax Academy
 

Viewers also liked (20)

PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PDF
Data Modeling for Apache Cassandra
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Apache Cassandra and Drivers
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PDF
Getting Started with Graph Databases
DataStax Academy
 
PDF
Analytics with Spark and Cassandra
DataStax Academy
 
PDF
Cassandra Data Maintenance with Spark
DataStax Academy
 
PDF
Cassandra: One (is the loneliest number)
DataStax Academy
 
PDF
Successful Software Development with Apache Cassandra
DataStax Academy
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Data Modeling for Apache Cassandra
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 
Apache Cassandra and Drivers
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Getting Started with Graph Databases
DataStax Academy
 
Analytics with Spark and Cassandra
DataStax Academy
 
Cassandra Data Maintenance with Spark
DataStax Academy
 
Cassandra: One (is the loneliest number)
DataStax Academy
 
Successful Software Development with Apache Cassandra
DataStax Academy
 
Ad

Similar to Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra (20)

PPTX
DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...
DataStax
 
PDF
Where Is Your Data?: An Introduction to Problems and Bottlenecks in Data Systems
InsightDataScience
 
PDF
Fast Cycle, Multi-Terabyte Data Analysis with Amazon Redshift and ClearStory ...
ClearStory Data
 
PDF
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
HBaseCon
 
PDF
20160331 sa introduction to big data pipelining berlin meetup 0.3
Simon Ambridge
 
PPTX
Webinar: Don't Leave Your Data in the Dark
DataStax
 
PDF
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
Big Data Joe™ Rossi
 
PDF
Data Con LA 2018 - Analyzing Movie Reviews using DataStax by Amanda Moran
Data Con LA
 
PPTX
start_your_datacenter_sds_v3
David Byte
 
PDF
How is Real-Time Analytics Different from Traditional OLAP?
sonikadigital1
 
PPTX
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Clustrix
 
PDF
Data for Action Talk - 2016-02-22
David E Drummond
 
PDF
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
Data Con LA
 
PDF
DataStax Enterprise in the Field – 20160920
Daniel Cohen
 
PDF
Using ScyllaDB for Real-Time Read-Heavy Workloads.pdf
ScyllaDB
 
PDF
The Data Lake and Getting Buisnesses the Big Data Insights They Need
Dunn Solutions Group
 
PDF
Accelerating analytics in a new era of data
Arnon Shimoni
 
PDF
CIS 2015 To Infinity and Beyond: Web Scale Session Management - Lukasz Jaromin
CloudIDSummit
 
PPTX
HBase Operations and Best Practices
Venu Anuganti
 
PDF
Big Data - Module 1
Aneej Matthai
 
DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...
DataStax
 
Where Is Your Data?: An Introduction to Problems and Bottlenecks in Data Systems
InsightDataScience
 
Fast Cycle, Multi-Terabyte Data Analysis with Amazon Redshift and ClearStory ...
ClearStory Data
 
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
HBaseCon
 
20160331 sa introduction to big data pipelining berlin meetup 0.3
Simon Ambridge
 
Webinar: Don't Leave Your Data in the Dark
DataStax
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
Big Data Joe™ Rossi
 
Data Con LA 2018 - Analyzing Movie Reviews using DataStax by Amanda Moran
Data Con LA
 
start_your_datacenter_sds_v3
David Byte
 
How is Real-Time Analytics Different from Traditional OLAP?
sonikadigital1
 
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Clustrix
 
Data for Action Talk - 2016-02-22
David E Drummond
 
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
Data Con LA
 
DataStax Enterprise in the Field – 20160920
Daniel Cohen
 
Using ScyllaDB for Real-Time Read-Heavy Workloads.pdf
ScyllaDB
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
Dunn Solutions Group
 
Accelerating analytics in a new era of data
Arnon Shimoni
 
CIS 2015 To Infinity and Beyond: Web Scale Session Management - Lukasz Jaromin
CloudIDSummit
 
HBase Operations and Best Practices
Venu Anuganti
 
Big Data - Module 1
Aneej Matthai
 
Ad

More from DataStax Academy (7)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PDF
Make 2016 your year of SMACK talk
DataStax Academy
 
PDF
Client Drivers and Cassandra, the Right Way
DataStax Academy
 
PPTX
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
DataStax Academy
 
PDF
Traveler's Guide to Cassandra
DataStax Academy
 
PPTX
Spark Cassandra Connector: Past, Present and Furure
DataStax Academy
 
PDF
New features in 3.0
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Make 2016 your year of SMACK talk
DataStax Academy
 
Client Drivers and Cassandra, the Right Way
DataStax Academy
 
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
DataStax Academy
 
Traveler's Guide to Cassandra
DataStax Academy
 
Spark Cassandra Connector: Past, Present and Furure
DataStax Academy
 
New features in 3.0
DataStax Academy
 

Recently uploaded (20)

PDF
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
Edge AI and Vision Alliance
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PDF
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
PDF
Linux schedulers for fun and profit with SchedKit
Alessio Biancalana
 
PPTX
Essential Content-centric Plugins for your Website
Laura Byrne
 
PPTX
Role_of_Artificial_Intelligence_in_Livestock_Extension_Services.pptx
DrRajdeepMadavi
 
PPTX
Manual Testing for Accessibility Enhancement
Julia Undeutsch
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
NASA A Researcher’s Guide to International Space Station : Fundamental Physics
Dr. PANKAJ DHUSSA
 
PDF
[GDGoC FPTU] Spring 2025 Summary Slidess
minhtrietgect
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pdf
ghjghvhjgc
 
PDF
Next Generation AI: Anticipatory Intelligence, Forecasting Inflection Points ...
dleka294658677
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
Edge AI and Vision Alliance
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
Linux schedulers for fun and profit with SchedKit
Alessio Biancalana
 
Essential Content-centric Plugins for your Website
Laura Byrne
 
Role_of_Artificial_Intelligence_in_Livestock_Extension_Services.pptx
DrRajdeepMadavi
 
Manual Testing for Accessibility Enhancement
Julia Undeutsch
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
NASA A Researcher’s Guide to International Space Station : Fundamental Physics
Dr. PANKAJ DHUSSA
 
[GDGoC FPTU] Spring 2025 Summary Slidess
minhtrietgect
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pdf
ghjghvhjgc
 
Next Generation AI: Anticipatory Intelligence, Forecasting Inflection Points ...
dleka294658677
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 

Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra

  • 1. DataStax Advanced Replication Brian Hess Sr. Product Manager, Analytics
  • 2. Why Advanced Replication • Standard Cassandra replication has its limits • Lots of disconnected “edge” nodes/data centers/clusters • Replicating to central “mother ship” for aggregating • Inconsistent connectivity • All data centers are read-write – no read-only DCs 2© 2016 DataStax, All Rights Reserved.
  • 3. What is Advanced Replication • Advanced Replication supports: • Many edge clusters replicating to a central hub • Consistent or sporadic connectivity – “store and forward” • Prioritized streams for limited bandwidth situations • One-way replication • Active queries at the edge, as well as replicating to the hub • Search/Analytics supported at edge and hub clusters 3© 2016 DataStax, All Rights Reserved.
  • 4. Company Confidential “What was Brian’s average purchase per store this week?” Analytics Over All Data “What did Brian buy today across all stores?” Can Query Global Sales “What was the hottest product here this week?” Analytics of Local Sales “What did Brian buy here today?” Can Query Local Sales Each Store Central Hub Example: Retail Sales © 2016 DataStax, All Rights Reserved.
  • 5. Company Confidential Key Verticals © 2016 DataStax, All Rights Reserved.
  • 6. Architecture – Edge View 6 Client Edge Replication Log Collection Agent Table Replication Agent Hub Cluster Table © 2016 DataStax, All Rights Reserved.
  • 7. Architecture – Edge View 7 Client Edge Replication Log Collection Agent Table Replication Agent Hub Cluster Table Normal CQL Operation CQL Trigger captures mutation Maintained in C* table for Fault Tolerance Pulls from Replication Log in priority/time order Replicates to Hub via normal CQL driver High Priority mutations opportunistically sent to Hub asynchronously © 2016 DataStax, All Rights Reserved.
  • 8. Topology 8© 2016 DataStax, All Rights Reserved.

Editor's Notes

  • #5: This slide represents an example of Retail Point-of-Sale Transactions.
  • #6: Oil and Gas Industrial IoT Retail Banking, Finance Telecommunications Transportation Mobile deployments or deployments with poor connectivity Oil rigs, mining, cruise ships, planes, etc.