SlideShare a Scribd company logo
Benjamin Cefalo, Lead Product Manager, MongoDB
Modern Data Backup and Recovery from On Premises to the
Public Cloud
@bencefalo
Safe Harbor Statement
This presentation contains “forward-looking statements” within the meaning of Section 27A of the Securities Act of 1933,
as amended, and Section 21E of the Securities Exchange Act of 1934, as amended. Such forward-looking statements are
subject to a number of risks, uncertainties, assumptions and other factors that could cause actual results and the timing of
certain events to differ materially from future results expressed or implied by the forward-looking statements. Factors that
could cause or contribute to such differences include, but are not limited to, those identified our filings with the Securities
and Exchange Commission. You should not rely upon forward-looking statements as predictions of future events.
Furthermore, such forward-looking statements speak only as of the date of this presentation.
In particular, the development, release, and timing of any features or functionality described for MongoDB products
remains at MongoDB’s sole discretion. This information is merely intended to outline our general product direction and it
should not be relied on in making a purchasing decision nor is this a commitment, promise or legal obligation to deliver any
material, code, or functionality. Except as required by law, we undertake no obligation to update any forward-looking
statements to reflect events or circumstances after the date of such statements.
Quick level set — We’re not going to talk about
disaster recovery
Application
Driver
Primary
Secondary
Secondary
Replication
Application
Driver
Primary
Secondary
Secondary
Replication
But what happens when you have
data corruption in your primary or
someone accidentally does
something they shouldn’t have... ?
Application
Driver
Primary
Secondary
Secondary
Replication
Application
Driver
Primary
Secondary
Secondary
Replication
Data Recovery - That’s what we’re going to focus
on today
Private DBaaS: On-Prem Public DBaaS: Fully Managed
Built on the Same Code Base, Same API, Same Management UI
MongoDB Offerings
Hybrid DBaaS
The Future of Backup for
Self Managing or On-
Premises
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
OM Group
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
OM Group
BackupLoadBalancer
App Server for
Backup
App Server for
Backup
App Server for
Backup
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
OM Group
BackupLoadBalancer
App Server for
Backup
App Server for
Backup
App Server for
Backup
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
...
...
Blockstore06
Blockstore04
Blockstore02
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
OM Group
BackupLoadBalancer
App Server for
Backup
App Server for
Backup
App Server for
Backup
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Blockstore01
Blockstore03
Blockstore05
...
...
...
...
Oplog04
Oplog02
Blockstore06
Blockstore04
Blockstore02
Oplog06
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
OM Group
BackupLoadBalancer
App Server for
Backup
App Server for
Backup
App Server for
Backup
Oplog01
Oplog03
Oplog05
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Blockstore01
Blockstore03
Blockstore05
Backup Strategy
● Reduce the footprint and architectural complexities needed for backup
● Reduce cost
● Improve performance
WiredTiger Checkpoints
What is it?
● Enhanced WiredTiger to take checkpoints of itself
● Checkpoints are then moved to long term storage
● HeadDB’s are completely eliminated — no more initial syncs!
○ Reduced storage requirements
○ Reduces architectural complexities
○ Reduces infrastructure required
● Consolidated agents, from three agents to one
...
...
...
...
Oplog04
Oplog02
Blockstore06
Blockstore04
Blockstore02
Oplog06
On-Premises Backup Architecture Today
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
Backup Agent
Monitoring Agent
OM Group
BackupLoadBalancer
App Server for
Backup
App Server for
Backup
App Server for
Backup
Oplog01
Oplog03
Oplog05
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Backup
Daemon
HeadDB
Blockstore01
Blockstore03
Blockstore05
Oplog02
Blockstore
New Backup Architecture
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
OM Group
BackupLoadBalancer
App
Server/Backup
Daemon
App
Server/Backup
Daemon
Oplog01
S3 Snapshot Store
Backup Node Selection Order
1. Hidden Secondaries
2. A secondary we have already taken a snapshot from
3. Secondary closet to the time of the time of the snapshot
4. Any available secondary
5. Primary
Phase 1 Details
• Backup agent required to exist on every node
• OM 4.2 required for backing up 4.2
• Backwards compatible for 4.0 and below
• Daemons are used for background tasks
• Replica sets only, sharded clusters coming later
• Advanced features coming later
S3 Oplog
Blockstore
New Backup Architecture — Future
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
OM Group
BackupLoadBalancer
App Server for
Backup
App Server for
Backup
Oplog01
S3 Snapshot Store
Future Improvements
● Agents read/write directly to OpLog and snapshots store
S3 Oplog
New Backup Architecture — Future
Ops Manager
OM01
OM02
OM03
OM04
OM05
OM06
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
MongoDB Agent
OM Group
S3 Snapshot Store
Future Improvements
● Agents read/write directly to OpLog and snapshots store
● Leave a checkpoint behind
What this means for you...
● Less people, time, and energy you have to spend to deliver a best-in-class data
recovery solution for your on-premises and self-managed MongoDB
deployments
Atlas
Private DBaaS: On-Prem Public DBaaS: Fully Managed
Built on the Same Code Base, Same API, Same Management UI
Atlas: Same Features as Ops Manager & Cloud Manager
Hybrid DBaaS
The Latest MongoDB Features
• MongoDB Atlas comes out-of-the-box with
MongoDB 3.4, 3.6, 4.0
• Transactions (4.0)
• Change Streams (3.6)
• JSON Schema (3.6)
• Expressive nested array updates (3.6)
• Expressive joins: $lookup (3.6)
• Graph queries (3.4)
• Facets & expressive aggregations (3.4)
• Minor updates and major upgrades without
downtime
● Data Explorer
○ Inspect schema & index utilization
● Real-Time Performance Panel
○ Live telemetry: in-flight operations & resource
consumption
● Performance Advisor
○ Always-on index recommendations
Multi-AZ Disaster Recovery by Default
AZ IIAZ I AZ III
...with Optional Multi-Region Fault Tolerance
AZ IIAZ I AZ III
Preferred Region
AZ I AZ II AZ I AZ II
Failover Region Failover Region
Atlas Backups: Data Recovery
Roll back the clock when you run
into issues triggered by user or
application errors that are
replicated from the primary to
the rest of your cluster.
Point-in-Time Data Recovery
• Lets you select a restore time based on your PIT window
• Restores the closest snapshot and rolls ahead
• Reduces the possibility of data loss
What About Small Disasters?
• The application is working fine
• But there is data missing or has
been altered
• No time to do a full restore
Queryable Backups
• Ability to query your snapshots and
restore data at the document level in
minutes.
• Reduces the operational overhead
associated with:
• Identifying whether data of interest has
been altered
• Pinpointing the best point in time to
restore a database
db = source.locations
db2 = destination.locations
zips = db.zipcodes
zips2 = db2.zipcodes
def restore():
print "Finding Missing Data"
query = {'state': ‘NY'}
try:
cursor = zips.find(query)
except Exception as e:
print "Unexpected error:", type(e), e
for doc in cursor:
zips2.insert(doc)
Sample Queryable Script
Demo
Continuous Backup Regions
Cloud Provider Snapshots (CPS)
• Utilizes each providers native snapshot capabilities
• Granular backup region selection
• Satisfy data sovereignty requirements
• Supports replica sets and sharded clusters
• Pricing is based on snapshot size, not data size
• Less expensive, starting at $0.08 per GB of snapshot
size (varies per provider and region)
• Now available on all cloud providers!
• AWS and GCP snapshots are incremental
Incremental snapshots for Atlas
customers deploying on AWS
10 GB 15 GB (5 new) 20 GB (5 new)
Cloud Provider Snapshots (CPS) Updates — Continued
• Bring your own keys
• AWS KMS
• Azure Key Vault
• GCP KMS
• Backup Policies
• On-demand snapshots (quicksave)
• Takes a snapshot immediately if there’s not already one in progress
• API for pipeline integrations
• M2/M5 Backups (Coming Soon!)
Continuous Backup Regions
Snapshot Backup Regions
Cloud Provider Snapshots
Demo
Whats Next?
Cloud Provider Snapshots - Future
● Point in Time Restores with customizable window
● Build a new cluster from a backup
● Queryable Backup with Selective restore
Pick the fully managed backup solution that’s right for you
Within Atlas is a modern data recovery platform that enables you to choose how you want to backup your
data, depending on your requirements.
Continuous
● Point-in-time restore
○ All but eliminate the possibility of data
loss
● Queryable snapshots
○ Rapidly restore granular data
Cloud Provider Snapshots
● Localized backup
○ Satisfy data sovereignty requirements
● Bring your own keys
● Take snapshots on-demand
● The cost effective option
MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the Public Cloud

More Related Content

Similar to MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the Public Cloud (20)

PDF
MongoDB World 2018: Solving Your Backup Needs Using MongoDB Ops Manager, Clou...
MongoDB
 
PPTX
Solving Your Backup Needs Using Ops Manager, Cloud Manager and Atlas
MongoDB
 
PDF
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Severalnines
 
PPTX
Addressing Your Backup Needs Using Ops Manager and Atlas
MongoDB
 
PPTX
Webinar: Backups and Disaster Recovery
MongoDB
 
PPTX
Webinar: Backups + Disaster Recovery
MongoDB
 
PPTX
Run MongoDB with Confidence: Backing up and Monitoring with MMS
MongoDB
 
PPTX
Run MongoDB with Confidence Using MongoDB Management Service (MMS)
MongoDB
 
PPTX
MongoDB Management Service: Getting Started with MMS
MongoDB
 
PPTX
Webinar: MongoDB Management Service (MMS): Session 02 - Backing up Data
MongoDB
 
PPTX
Walking the Walk: Developing the MongoDB Backup Service with MongoDB
MongoDB
 
PDF
Advanced Administration, Monitoring and Backup
MongoDB
 
PDF
Oracle Storage a ochrana dat
MarketingArrowECS_CZ
 
PPTX
MongoDB Management Service (MMS): Session 01: Getting Started with MMS
MongoDB
 
PPTX
La creación de una capa operacional con MongoDB
MongoDB
 
PDF
MongoDB Backups and PITR
Igor Donchovski
 
PPTX
Webinar: Application Explosion - Rediscovering the Lost Art of Protection Ser...
Storage Switzerland
 
PPTX
Webinar: Keeping Your MongoDB Data Safe
MongoDB
 
PDF
Oracle Cloud Maximum Availability Architecture
Yuri Carvalho Marques
 
PPTX
An Introduction to MongoDB Ops Manager
MongoDB
 
MongoDB World 2018: Solving Your Backup Needs Using MongoDB Ops Manager, Clou...
MongoDB
 
Solving Your Backup Needs Using Ops Manager, Cloud Manager and Atlas
MongoDB
 
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Severalnines
 
Addressing Your Backup Needs Using Ops Manager and Atlas
MongoDB
 
Webinar: Backups and Disaster Recovery
MongoDB
 
Webinar: Backups + Disaster Recovery
MongoDB
 
Run MongoDB with Confidence: Backing up and Monitoring with MMS
MongoDB
 
Run MongoDB with Confidence Using MongoDB Management Service (MMS)
MongoDB
 
MongoDB Management Service: Getting Started with MMS
MongoDB
 
Webinar: MongoDB Management Service (MMS): Session 02 - Backing up Data
MongoDB
 
Walking the Walk: Developing the MongoDB Backup Service with MongoDB
MongoDB
 
Advanced Administration, Monitoring and Backup
MongoDB
 
Oracle Storage a ochrana dat
MarketingArrowECS_CZ
 
MongoDB Management Service (MMS): Session 01: Getting Started with MMS
MongoDB
 
La creación de una capa operacional con MongoDB
MongoDB
 
MongoDB Backups and PITR
Igor Donchovski
 
Webinar: Application Explosion - Rediscovering the Lost Art of Protection Ser...
Storage Switzerland
 
Webinar: Keeping Your MongoDB Data Safe
MongoDB
 
Oracle Cloud Maximum Availability Architecture
Yuri Carvalho Marques
 
An Introduction to MongoDB Ops Manager
MongoDB
 

More from MongoDB (20)

PDF
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB
 
PDF
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
PDF
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB
 
PDF
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB
 
PDF
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB
 
PDF
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB
 
PDF
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
PDF
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB
 
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB
 
PDF
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB
 
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB
 
PDF
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB
 
PDF
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB
 
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB
 
Ad

Recently uploaded (20)

PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
July Patch Tuesday
Ivanti
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Ad

MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the Public Cloud

  • 1. Benjamin Cefalo, Lead Product Manager, MongoDB Modern Data Backup and Recovery from On Premises to the Public Cloud @bencefalo
  • 2. Safe Harbor Statement This presentation contains “forward-looking statements” within the meaning of Section 27A of the Securities Act of 1933, as amended, and Section 21E of the Securities Exchange Act of 1934, as amended. Such forward-looking statements are subject to a number of risks, uncertainties, assumptions and other factors that could cause actual results and the timing of certain events to differ materially from future results expressed or implied by the forward-looking statements. Factors that could cause or contribute to such differences include, but are not limited to, those identified our filings with the Securities and Exchange Commission. You should not rely upon forward-looking statements as predictions of future events. Furthermore, such forward-looking statements speak only as of the date of this presentation. In particular, the development, release, and timing of any features or functionality described for MongoDB products remains at MongoDB’s sole discretion. This information is merely intended to outline our general product direction and it should not be relied on in making a purchasing decision nor is this a commitment, promise or legal obligation to deliver any material, code, or functionality. Except as required by law, we undertake no obligation to update any forward-looking statements to reflect events or circumstances after the date of such statements.
  • 3. Quick level set — We’re not going to talk about disaster recovery
  • 5. Application Driver Primary Secondary Secondary Replication But what happens when you have data corruption in your primary or someone accidentally does something they shouldn’t have... ?
  • 8. Data Recovery - That’s what we’re going to focus on today
  • 9. Private DBaaS: On-Prem Public DBaaS: Fully Managed Built on the Same Code Base, Same API, Same Management UI MongoDB Offerings Hybrid DBaaS
  • 10. The Future of Backup for Self Managing or On- Premises
  • 11. On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06
  • 12. On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent OM Group
  • 13. On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent OM Group BackupLoadBalancer App Server for Backup App Server for Backup App Server for Backup
  • 14. On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent OM Group BackupLoadBalancer App Server for Backup App Server for Backup App Server for Backup Backup Daemon HeadDB Backup Daemon HeadDB Backup Daemon HeadDB
  • 15. ... ... Blockstore06 Blockstore04 Blockstore02 On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent OM Group BackupLoadBalancer App Server for Backup App Server for Backup App Server for Backup Backup Daemon HeadDB Backup Daemon HeadDB Backup Daemon HeadDB Blockstore01 Blockstore03 Blockstore05
  • 16. ... ... ... ... Oplog04 Oplog02 Blockstore06 Blockstore04 Blockstore02 Oplog06 On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent OM Group BackupLoadBalancer App Server for Backup App Server for Backup App Server for Backup Oplog01 Oplog03 Oplog05 Backup Daemon HeadDB Backup Daemon HeadDB Backup Daemon HeadDB Blockstore01 Blockstore03 Blockstore05
  • 17. Backup Strategy ● Reduce the footprint and architectural complexities needed for backup ● Reduce cost ● Improve performance
  • 19. What is it? ● Enhanced WiredTiger to take checkpoints of itself ● Checkpoints are then moved to long term storage ● HeadDB’s are completely eliminated — no more initial syncs! ○ Reduced storage requirements ○ Reduces architectural complexities ○ Reduces infrastructure required ● Consolidated agents, from three agents to one
  • 20. ... ... ... ... Oplog04 Oplog02 Blockstore06 Blockstore04 Blockstore02 Oplog06 On-Premises Backup Architecture Today Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent Backup Agent Monitoring Agent OM Group BackupLoadBalancer App Server for Backup App Server for Backup App Server for Backup Oplog01 Oplog03 Oplog05 Backup Daemon HeadDB Backup Daemon HeadDB Backup Daemon HeadDB Blockstore01 Blockstore03 Blockstore05
  • 21. Oplog02 Blockstore New Backup Architecture Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent OM Group BackupLoadBalancer App Server/Backup Daemon App Server/Backup Daemon Oplog01 S3 Snapshot Store
  • 22. Backup Node Selection Order 1. Hidden Secondaries 2. A secondary we have already taken a snapshot from 3. Secondary closet to the time of the time of the snapshot 4. Any available secondary 5. Primary
  • 23. Phase 1 Details • Backup agent required to exist on every node • OM 4.2 required for backing up 4.2 • Backwards compatible for 4.0 and below • Daemons are used for background tasks • Replica sets only, sharded clusters coming later • Advanced features coming later
  • 24. S3 Oplog Blockstore New Backup Architecture — Future Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent OM Group BackupLoadBalancer App Server for Backup App Server for Backup Oplog01 S3 Snapshot Store
  • 25. Future Improvements ● Agents read/write directly to OpLog and snapshots store
  • 26. S3 Oplog New Backup Architecture — Future Ops Manager OM01 OM02 OM03 OM04 OM05 OM06 MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent MongoDB Agent OM Group S3 Snapshot Store
  • 27. Future Improvements ● Agents read/write directly to OpLog and snapshots store ● Leave a checkpoint behind
  • 28. What this means for you... ● Less people, time, and energy you have to spend to deliver a best-in-class data recovery solution for your on-premises and self-managed MongoDB deployments
  • 29. Atlas
  • 30. Private DBaaS: On-Prem Public DBaaS: Fully Managed Built on the Same Code Base, Same API, Same Management UI Atlas: Same Features as Ops Manager & Cloud Manager Hybrid DBaaS
  • 31. The Latest MongoDB Features • MongoDB Atlas comes out-of-the-box with MongoDB 3.4, 3.6, 4.0 • Transactions (4.0) • Change Streams (3.6) • JSON Schema (3.6) • Expressive nested array updates (3.6) • Expressive joins: $lookup (3.6) • Graph queries (3.4) • Facets & expressive aggregations (3.4) • Minor updates and major upgrades without downtime ● Data Explorer ○ Inspect schema & index utilization ● Real-Time Performance Panel ○ Live telemetry: in-flight operations & resource consumption ● Performance Advisor ○ Always-on index recommendations
  • 32. Multi-AZ Disaster Recovery by Default AZ IIAZ I AZ III
  • 33. ...with Optional Multi-Region Fault Tolerance AZ IIAZ I AZ III Preferred Region AZ I AZ II AZ I AZ II Failover Region Failover Region
  • 34. Atlas Backups: Data Recovery Roll back the clock when you run into issues triggered by user or application errors that are replicated from the primary to the rest of your cluster.
  • 35. Point-in-Time Data Recovery • Lets you select a restore time based on your PIT window • Restores the closest snapshot and rolls ahead • Reduces the possibility of data loss
  • 36. What About Small Disasters? • The application is working fine • But there is data missing or has been altered • No time to do a full restore
  • 37. Queryable Backups • Ability to query your snapshots and restore data at the document level in minutes. • Reduces the operational overhead associated with: • Identifying whether data of interest has been altered • Pinpointing the best point in time to restore a database
  • 38. db = source.locations db2 = destination.locations zips = db.zipcodes zips2 = db2.zipcodes def restore(): print "Finding Missing Data" query = {'state': ‘NY'} try: cursor = zips.find(query) except Exception as e: print "Unexpected error:", type(e), e for doc in cursor: zips2.insert(doc) Sample Queryable Script
  • 39. Demo
  • 41. Cloud Provider Snapshots (CPS) • Utilizes each providers native snapshot capabilities • Granular backup region selection • Satisfy data sovereignty requirements • Supports replica sets and sharded clusters • Pricing is based on snapshot size, not data size • Less expensive, starting at $0.08 per GB of snapshot size (varies per provider and region) • Now available on all cloud providers! • AWS and GCP snapshots are incremental Incremental snapshots for Atlas customers deploying on AWS 10 GB 15 GB (5 new) 20 GB (5 new)
  • 42. Cloud Provider Snapshots (CPS) Updates — Continued • Bring your own keys • AWS KMS • Azure Key Vault • GCP KMS • Backup Policies • On-demand snapshots (quicksave) • Takes a snapshot immediately if there’s not already one in progress • API for pipeline integrations • M2/M5 Backups (Coming Soon!)
  • 47. Cloud Provider Snapshots - Future ● Point in Time Restores with customizable window ● Build a new cluster from a backup ● Queryable Backup with Selective restore
  • 48. Pick the fully managed backup solution that’s right for you Within Atlas is a modern data recovery platform that enables you to choose how you want to backup your data, depending on your requirements. Continuous ● Point-in-time restore ○ All but eliminate the possibility of data loss ● Queryable snapshots ○ Rapidly restore granular data Cloud Provider Snapshots ● Localized backup ○ Satisfy data sovereignty requirements ● Bring your own keys ● Take snapshots on-demand ● The cost effective option