SlideShare a Scribd company logo
When and how to migrate from a relational database to Cassandra
Introduction
• Ben Slater, Chief Product Officer, Instaclustr
• Cassandra as a managed service on AWS, Azure & IBM
SoftLayer
• 20 years experience as a developer, architecture and dev team
lead
2© 2015. All Rights Reserved.
1 Introduction
2 When to consider migration
3 Preparing your application
4 Migration approaches
5 Conclusion
3© 2015. All Rights Reserved.
When to consider migration
• Reaching physical scalability limits
• Licensing costs becoming prohibitive
• Need 100% availability
• Increasing DBA time to maintain performance / availability
• Active/active multi-DC / disaster recovery requirements
• Weigh against costs:
- Initial migration
- Additional logic maintained in app (eg maintaining denormalised duplicate
data)
© 2015. All Rights Reserved. 4
Preparing your application
Some approaches while still using relational can help reduce
migration costs:
• Abstract data access layer (service oriented architecture)
• Denormalise within relational DB
• Minimise logic implemented in DB
• Build data validation checks & data profiles
© 2015. All Rights Reserved. 5
Migrations Approaches
• Big bang cutover
• Parallel run
• Table by table
© 2015. All Rights Reserved. 6
Big bang cutover
• Build & test version of app using C* and convert data from
relational to C*
• Shutdown relational, convert data, start-up on Cassandra
• Requires downtime, high risk but likely lowest effort option
© 2015. All Rights Reserved. 7
Parallel run
• Build C* tables
• Modify application to write to both C* and relational
• Develop & execute tool to perform initial sync and reconciliation of dbs
• Run and regularly reconcile
• Migrate reads to C*
• More complex to build and manage
• Lower risk and can be done with no downtime
© 2015. All Rights Reserved. 8
Table by table / function by function
• Either big-bang or parallel run approaches can be done on a
table-by-table basis
• Need to be able isolate subject areas with minimal joins in
relational DB (likely to correspond to denormalised C* tables)
• Allows staged implementation, gradually moving load from
relational to C* - useful if relational environment is under
immediate capacity pressure
• Incrementally reduce pressure on relational
© 2015. All Rights Reserved. 9
Estimating Guide
Work Items
• Revise & test operational procedures
• Performance test & soak test
• Trial conversions
• Execute production migration
• Application changes & regression test
• Build migration tool
• Build reconciliation tool
• Build C* schema
Effort Drivers
• # of source tables
• # of access paths
• migration approach
• Level of “preparedness” (slide 5)
© 2015. All Rights Reserved. 10
Considerations
• Don’t forget analytics/ad-hoc querying requirements
• Denormalise – it should feel wrong
• Keep in mind common C* data modelling traps:
– Partition keys
– Tombstones
– Secondary indexes
• Make sure your reads work before migrating / writing
• Upserts make migration easier
© 2015. All Rights Reserved. 11
Conclusion
• It has been done!
• Putting it off won’t make it any easier!
© 2015. All Rights Reserved. 12
Thank you

More Related Content

What's hot (20)

PDF
Flink Forward Berlin 2018: Ravi Suhag & Sumanth Nakshatrithaya - "Managing Fl...
Flink Forward
 
PDF
MongoDB and Machine Learning with Flowable
Flowable
 
PPTX
Flink Forward Berlin 2018: Timo Walther - "Flink SQL in Action"
Flink Forward
 
PPTX
ADSL ppt
Sathish Kumar D M
 
PPTX
Persistent Storage for Containerized Applications
Colleen Corrice
 
PDF
Modernization patterns to refactor a legacy application into event driven mic...
Bilgin Ibryam
 
PPT
UC4 SCHEDULING
roelspi
 
PDF
How to Discover, Visualize, Catalog, Share and Reuse your Kafka Streams (Jona...
HostedbyConfluent
 
PPTX
Migrate to platform of your choice
Ashnikbiz
 
PDF
Azure Cosmos DB Kafka Connectors | Abinav Rameesh, Microsoft
HostedbyConfluent
 
PPSX
SJM_ESI_Casestudy
Gandhavalla Informatics Pvt Ltd.
 
PPS
WEBridge 4 SAP R2
Gandhavalla Informatics Pvt Ltd.
 
PPTX
Flink Forward Berlin 2018: Aljoscha Krettek & Till Rohrmann - Keynote: "A Yea...
Flink Forward
 
PDF
ActiveMigrate - ECM Renovation Roadshow
Zia Consulting
 
PDF
Real-Time Vote Platform Benchmark
Lahav Savir
 
PPTX
Making the Transition from the Suite to the Hub
Jerika Phelps
 
PDF
TechEvent 2019: Whats new in biGENiUS; Robert Kranabether - Trivadis
Trivadis
 
PDF
mabl's Machine Learning Implementation on Google Cloud Platform
Joseph Lust
 
PPTX
Making the Transition from the Suite to the Hub
Black Duck by Synopsys
 
PDF
WEBridge 4 SAP ( Windchill and SAP Integration)
Gandhavalla Informatics Pvt Ltd.
 
Flink Forward Berlin 2018: Ravi Suhag & Sumanth Nakshatrithaya - "Managing Fl...
Flink Forward
 
MongoDB and Machine Learning with Flowable
Flowable
 
Flink Forward Berlin 2018: Timo Walther - "Flink SQL in Action"
Flink Forward
 
Persistent Storage for Containerized Applications
Colleen Corrice
 
Modernization patterns to refactor a legacy application into event driven mic...
Bilgin Ibryam
 
UC4 SCHEDULING
roelspi
 
How to Discover, Visualize, Catalog, Share and Reuse your Kafka Streams (Jona...
HostedbyConfluent
 
Migrate to platform of your choice
Ashnikbiz
 
Azure Cosmos DB Kafka Connectors | Abinav Rameesh, Microsoft
HostedbyConfluent
 
Flink Forward Berlin 2018: Aljoscha Krettek & Till Rohrmann - Keynote: "A Yea...
Flink Forward
 
ActiveMigrate - ECM Renovation Roadshow
Zia Consulting
 
Real-Time Vote Platform Benchmark
Lahav Savir
 
Making the Transition from the Suite to the Hub
Jerika Phelps
 
TechEvent 2019: Whats new in biGENiUS; Robert Kranabether - Trivadis
Trivadis
 
mabl's Machine Learning Implementation on Google Cloud Platform
Joseph Lust
 
Making the Transition from the Suite to the Hub
Black Duck by Synopsys
 
WEBridge 4 SAP ( Windchill and SAP Integration)
Gandhavalla Informatics Pvt Ltd.
 

Viewers also liked (20)

PPTX
Demos Castellana Grotte
Demos Castellana Grotte
 
PDF
Kurul kararı hukuk ve adalet
Murat Ferit
 
PPT
Report 25° Torneo Internazionale di Calcio Giovanile "Città di Abano Terme"
Consorzio Terme Euganee
 
DOC
бал аарыдагы керемет. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
PDF
BTO 2015 I Viaggio a Corleone I INTUS DMC
BTO Educational
 
DOC
аллах сүйүүсү. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
DOC
2013 2014 ieee dsp based projects
Ecwaytechnoz
 
PPTX
Load testing Cassandra applications
Ben Slater
 
DOC
атом керемети. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
DOC
биомиметика технология табиятты өрнөк алууда. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
PDF
Aiim White Paper: Document Process Outsourcing: in-house, onshore, near shore...
Swiss Post Solutions
 
DOC
белок керемети. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
DOCX
sangkar cv new 2015
SANGKAR ALAGAPPAN
 
PDF
Cassandra FrOSCon 10
jbellis
 
PPTX
Cassandra
julesbravo
 
PPTX
DataStax - Analytics on Apache Cassandra - Paris Tech Talks meetup
Victor Coustenoble
 
PDF
236 mobile optimization-cdnetworks
NAVER D2
 
PPTX
Data Modeling with Cassandra Column Families
gdusbabek
 
PPTX
Cassandra
Upaang Saxena
 
PDF
Getting started with Spark & Cassandra by Jon Haddad of Datastax
Data Con LA
 
Demos Castellana Grotte
Demos Castellana Grotte
 
Kurul kararı hukuk ve adalet
Murat Ferit
 
Report 25° Torneo Internazionale di Calcio Giovanile "Città di Abano Terme"
Consorzio Terme Euganee
 
бал аарыдагы керемет. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
BTO 2015 I Viaggio a Corleone I INTUS DMC
BTO Educational
 
аллах сүйүүсү. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
2013 2014 ieee dsp based projects
Ecwaytechnoz
 
Load testing Cassandra applications
Ben Slater
 
атом керемети. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
биомиметика технология табиятты өрнөк алууда. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
Aiim White Paper: Document Process Outsourcing: in-house, onshore, near shore...
Swiss Post Solutions
 
белок керемети. кyrgyz (кыргыз)
HarunyahyaKyrgyz
 
sangkar cv new 2015
SANGKAR ALAGAPPAN
 
Cassandra FrOSCon 10
jbellis
 
Cassandra
julesbravo
 
DataStax - Analytics on Apache Cassandra - Paris Tech Talks meetup
Victor Coustenoble
 
236 mobile optimization-cdnetworks
NAVER D2
 
Data Modeling with Cassandra Column Families
gdusbabek
 
Cassandra
Upaang Saxena
 
Getting started with Spark & Cassandra by Jon Haddad of Datastax
Data Con LA
 
Ad

Similar to When and how to migrate from a relational database to Cassandra (20)

PDF
Migrating to Cassandra
Instaclustr
 
PPTX
Scaling a SaaS backend with PostgreSQL - A case study
Oliver Seemann
 
PDF
Are we there Yet?? (The long journey of Migrating from close source to opens...
Marco Tusa
 
PDF
[EPPG] Oracle to PostgreSQL, Challenges to Opportunity
Equnix Business Solutions
 
PDF
Are You Ready for 12c? Data Migration and Upgrade Best Practices
Performance Tuning Corporation
 
PPTX
Cassandra Community Webinar: MySQL to Cassandra - What I Wish I'd Known
DataStax
 
PDF
Oracle to PostgreSQL, Challenges to Opportunity.pdf
Equnix Business Solutions
 
PDF
Migration Best Practices: From RDBMS to Cassandra without a Hitch
DataStax Academy
 
PDF
From rdbms to cassandra without a hitch
Duyhai Doan
 
PDF
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Gabriele Bartolini
 
PPT
Manager's Guide To Oracle Cost Containment
EDB
 
PPT
An Effective Approach to Migrate Cassandra Thrift to CQL (Yabin Meng, Pythian...
DataStax
 
PDF
Oracle to MySQL 2012
Marco Tusa
 
PPTX
Migration of a relational database to a NoSQL store
MarkRizun
 
PDF
Emerging database landscape july 2011
navaidkhan
 
PPTX
SAS Institute on Changing All Four Tires While Driving an AdTech Engine at Fu...
ScyllaDB
 
PPTX
Evolutionary database design
David Price
 
PDF
CTO Leadership Series: Schema Evolution Patterns
BrittanyShear
 
PDF
CTO Leadership Series: Schema Evolution Patterns
Aggregage
 
PDF
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
Insight Technology, Inc.
 
Migrating to Cassandra
Instaclustr
 
Scaling a SaaS backend with PostgreSQL - A case study
Oliver Seemann
 
Are we there Yet?? (The long journey of Migrating from close source to opens...
Marco Tusa
 
[EPPG] Oracle to PostgreSQL, Challenges to Opportunity
Equnix Business Solutions
 
Are You Ready for 12c? Data Migration and Upgrade Best Practices
Performance Tuning Corporation
 
Cassandra Community Webinar: MySQL to Cassandra - What I Wish I'd Known
DataStax
 
Oracle to PostgreSQL, Challenges to Opportunity.pdf
Equnix Business Solutions
 
Migration Best Practices: From RDBMS to Cassandra without a Hitch
DataStax Academy
 
From rdbms to cassandra without a hitch
Duyhai Doan
 
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Gabriele Bartolini
 
Manager's Guide To Oracle Cost Containment
EDB
 
An Effective Approach to Migrate Cassandra Thrift to CQL (Yabin Meng, Pythian...
DataStax
 
Oracle to MySQL 2012
Marco Tusa
 
Migration of a relational database to a NoSQL store
MarkRizun
 
Emerging database landscape july 2011
navaidkhan
 
SAS Institute on Changing All Four Tires While Driving an AdTech Engine at Fu...
ScyllaDB
 
Evolutionary database design
David Price
 
CTO Leadership Series: Schema Evolution Patterns
BrittanyShear
 
CTO Leadership Series: Schema Evolution Patterns
Aggregage
 
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
Insight Technology, Inc.
 
Ad

Recently uploaded (20)

PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 

When and how to migrate from a relational database to Cassandra

  • 1. When and how to migrate from a relational database to Cassandra
  • 2. Introduction • Ben Slater, Chief Product Officer, Instaclustr • Cassandra as a managed service on AWS, Azure & IBM SoftLayer • 20 years experience as a developer, architecture and dev team lead 2© 2015. All Rights Reserved.
  • 3. 1 Introduction 2 When to consider migration 3 Preparing your application 4 Migration approaches 5 Conclusion 3© 2015. All Rights Reserved.
  • 4. When to consider migration • Reaching physical scalability limits • Licensing costs becoming prohibitive • Need 100% availability • Increasing DBA time to maintain performance / availability • Active/active multi-DC / disaster recovery requirements • Weigh against costs: - Initial migration - Additional logic maintained in app (eg maintaining denormalised duplicate data) © 2015. All Rights Reserved. 4
  • 5. Preparing your application Some approaches while still using relational can help reduce migration costs: • Abstract data access layer (service oriented architecture) • Denormalise within relational DB • Minimise logic implemented in DB • Build data validation checks & data profiles © 2015. All Rights Reserved. 5
  • 6. Migrations Approaches • Big bang cutover • Parallel run • Table by table © 2015. All Rights Reserved. 6
  • 7. Big bang cutover • Build & test version of app using C* and convert data from relational to C* • Shutdown relational, convert data, start-up on Cassandra • Requires downtime, high risk but likely lowest effort option © 2015. All Rights Reserved. 7
  • 8. Parallel run • Build C* tables • Modify application to write to both C* and relational • Develop & execute tool to perform initial sync and reconciliation of dbs • Run and regularly reconcile • Migrate reads to C* • More complex to build and manage • Lower risk and can be done with no downtime © 2015. All Rights Reserved. 8
  • 9. Table by table / function by function • Either big-bang or parallel run approaches can be done on a table-by-table basis • Need to be able isolate subject areas with minimal joins in relational DB (likely to correspond to denormalised C* tables) • Allows staged implementation, gradually moving load from relational to C* - useful if relational environment is under immediate capacity pressure • Incrementally reduce pressure on relational © 2015. All Rights Reserved. 9
  • 10. Estimating Guide Work Items • Revise & test operational procedures • Performance test & soak test • Trial conversions • Execute production migration • Application changes & regression test • Build migration tool • Build reconciliation tool • Build C* schema Effort Drivers • # of source tables • # of access paths • migration approach • Level of “preparedness” (slide 5) © 2015. All Rights Reserved. 10
  • 11. Considerations • Don’t forget analytics/ad-hoc querying requirements • Denormalise – it should feel wrong • Keep in mind common C* data modelling traps: – Partition keys – Tombstones – Secondary indexes • Make sure your reads work before migrating / writing • Upserts make migration easier © 2015. All Rights Reserved. 11
  • 12. Conclusion • It has been done! • Putting it off won’t make it any easier! © 2015. All Rights Reserved. 12